J e? 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Boreas 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 5 : 

C12N 7/00, 7/01, 15/00, 15A1, 15/86, 
A61K 39/00, 39/29 



Al 



(11) International Publication Number: 
(43) Inte rnation al Publication Date: 



WO 94/12617 

9 June 1994 (09.06.94) 



(21) International Application Nnmber: PCT/US93/1 1474 

(22) International Fifing Date: 24 November 1993 (24.11.93) 



(30) Priority Data: 

982,211 



25 November 1992 (25.1 1.92) US 



I 



(71) Applicant: INTERNATIONAL BIOTECHNOLOGY LABO- 
RATORIES, INC [US/US]; 67 Rogers Street, Cambridge 
MA 02142 (US). 

(72) Inventory SOUW, Peter, T„ S4 4 56 Belmont Street, #12, 

Watertown, MA 02172 (US). O'KEEFE, Rhonda, Wilson; 
57 Gleo Drive, Hudson, NH 03051 (US). LEWIS, Tatyana; 
49 Gkndale Road, Newton, MA 02159 (US). BERNSTTNE, 
Edward, G.; 376 Commercial Street, Boston, MA 02109 
(US). 

(74) Agents: MISROCK, S„ Leslie ct aL; Pcnnie & Edmonds, 1155 
Avenue of the Americas, New York, NY 10036 (US). 



(81) Designated States: AU. BB, BG, BR, BY, CA, CZ, FL HU 
JP, KR, KZ.LK LV, MG, MN, MW, NO, N2L PL RO. 
RU, SD, SK, UA, TJZ, VN, European patent (AT, BE, CHL 
DE, DK. ES, FR, GB, GR, IE, IT, LU. MC. NL, PT, SE), 
OAH patent (BF, BJ, CF. CG, CL CM. GA, GN, ML, MR, 
NE, SN, TD, TGX 



Published 

Wi/A international search report 



(54) Title: HEPATITIS B VIRUS VACCINES 



Met 

-4- 



e antigen 



-10 0 



1 

149 

leavage at 5' 
and 3' termini 



159 amino acids 
secreted 



Met 



Met 



precore protein 



-29 



transcription 
£ translation 



— |214 amino acids 
165 localized to 
endoplasmic 
reticulum 



ATG 



ATG 
1 



1 



TAG 



DNA 



transcription 
€ translation 



Met 



core protein 



185 



(subtype adw) 



159 amino acids 
not secreted 



17:57:02 



(57) Abstract 

The present invention provides recombinant vaccinia viruses which are capable of expressing a plurality of immunogenic HBV 
epi topes comprising at least one core antigen epitope, at least one surface antigen epitope, or any combination of the foregoing. Preferred 
viruses containing plural HBV antigens include bat are not limited to a combination of one of S, MS, LS, and full-length core, coreA8 
(a core deletion mutant), preSl-careAS, preSl -complete core, core-preSl*, coro-prcS2, or core-S* (the a tt r m lr s denoting the presence of 
an immunogenic fragment of the preceding antigen; hyphens ^enotzng a fusion protein). As shown in the figure, the core and/or surface 
antigen epitopes can be expressed by the same recombinant virus, under the control of different promoters active in vaccinia virus. 
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HEPATITIS B VIRUS VACCINES 

lJ INTRODUCTION 
The present invention relates to recombinant 
5 vaccinia viruses which are capable of expressing a 
plurality of hepatitis B virus (HBV) epitopes 
comprising core antigen epitopes, surface antigen 
epitopes, or any combination of the foregoing. The 
vaccinia viruses are useful in vaccine formulations to 
10 prevent or treat hepatitis or other undesirable 
consequences of HBV infection. 

2. BACKGROUND OF THE INVENTION 

15 2.1. HEPATITIS B VIRUS 

Hepadna viruses , which include human 
hepatitis B virus (HBV) (Barker et al., 1975, Am. J. 
Med. Sci. 270:189-196), are hepatropic, lead to 
persistent infection, and have a common structure. 

20 Large concentrations of hepadnaviruses have been 
detected in the blood of infected organisms. 
Hepadnavirus virions are approximately 42 nm in 
diameter and have a double-walled structure consisting 
of an envelope and nucleocapsid (reviewed in Ganem and 

25 Various, 1987, Ann. Rev. Biochem. 56:651-693 and 

Tiollais et al., 1985, Nature (London) 317:489-495)* 
The outer coat, or envelope, contains three surface 
antigens, designated major or small (S) , middle (MS) , 
and large (I*S) , as well as carbohydrates and lipids. 

30 Hepatitis B surface antigen (HBsag) is found on all 
three of these surface antigen proteins. The 
nucleocapsid, or inner coat, contains a circular DNA 
(3.0-3.3 kb in length), a DNA polymerase, protein 
kinase activity, and hepatitis B core antigen (HBcAg) . 
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The hepadnavirus genome is a small, 



circular, partly double stranded DNA molecule 
(reviewed in Ganem and Varmus, supra, and Tiollais et 
al., supra). The minus strand is linear and of a 
5 fixed length, 3-3.2 kb. In contrast, the plus strand 
is of variable length, ranging from 50-100% of that of 
the minus strand. 



sequences of the cloned HBV genome indicated that the 

10 minus strand contains four major open reading frames 
(ORFs) , s, C, P, and X (reviewed in Ganem and Varmus, 
supra, and Tiollais, supra) . ORF S encodes a total of 
389-400 amino acids, depending upon subtype. ORF S is 
divided into the preSl region (encoding 108-119 amino 

15 acids) , preS2 region (encoding 55 amino acids) , and 
S gene (encoding 226 amino acids) in 5' to 3' order, 
each of which begins with an ATG codon capable of 
functioning as a translation initiation site in vivo. 
Gene S encodes small (S) antigen, the HBsAg, which is 

20 the major envelope protein. A 500 nucleotide sequence 
upstream of the S gene contains the preSl and preS2 
sequences. Middle (MS) protein is encoded by the 
preS2 and S regions, and large (LS) protein is encoded 
by the preSl, preS2 and S regions. ORF C, which codes 

25 for HBcAg, is divided into the C gene and pre-c 

region. ORF P encodes the viral polymerase. ORF X 
encodes a protein which enhances expression of the 
other viral genes. 



30 complex process involving two different precursors, 

precore and core, and post translational truncation at 
both amino and carboxy termini (Fig. 1) - Two main 
forms of core-related antigens . result from this 
processing: e antigen, which is made from precore and 

35 found mainly extracellularly in infected individuals, 



The results from analyses of nucleotide 



Core-related antigen production in vivo is a 
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20 



25 



30 



and core, which is found almost: exclusively 
intracellularly , usually within the nucleus of the 
cell. Putative nuclear localization sequences of the 
hepatitis B core protein include a set of direct 
repeats having the amino acid sequence PKRRRSQS (SEQ 
ID NO: 1) located in tandem in the protamine -like 
domain of core. Yen et al. (1990, J* Virol. 64:6141- 
6147) suggest that these repeats are necessary for 
nuclear localization. Bckhardt et al* (1991, J. 
Virol. 65:575-582) suggest that other sequences may" 
also be involved. 

The structure and expression of the 
hepatitis B viral genome has been studied by 
recombinant DNA techniques. U.K. Patent Publication 
No. GB 2034323A (published June 4, 1980) discloses the 
cloning and expression of HBV DNA. Moriarty et al. 
(1981, Proc. Natl. Acad. Sci. USA 78:2606-2610) 
describes the construction and expression of a 
recombinant simian virus containing a fragment of HBV 
DNA encoding the HBsAg; 22 nm surface antigen 
particles were shown to be excreted into the cell 
culture medium. European Patent Publication No. 
0020251 (published December 10, 1980) describes the 
construction of and expression from recombinant 
expression vectors encoding HBV proteins. European 
Patent Publication No. 0013828 (published August 6, 
1980) discloses recombinant DNA vectors which contain 
fragments of HBV DNA, isolated from Dane particles, 
which encode proteins with HBV antigenicity. 



widespread and serious health problem resulting from 
infection by the hepatitis B virus. A large 
proportion of people living in regions of poor medical 



2.2. 



BIOLOGY OF HBV INFECTION 



The liver disease called hepatitis B is a 
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care are chronically infected with the virus and also 
face an elevated risk of acquiring liver cancer, since 
HBV is associated with liver carcinoma. HBV is 
endemic in Asia, and causes one of the largest health 
5 problems in that region, where about 10% of the 
population are chronically infected carriers. The 
toll of this figure on the health of Asians is severe; 
in addition to the impact of acute HBV infection, 
chronic carriers are atL.vastly increased risk from 
10 cirrhosis and primary hepatocellular carcinoma (PHC)*. 
PHC is the second most prevalent cancer in Asia, 
accounting for up to 22% of cancer deaths in some 
countries . 

HBV infection has been observed to be highly 

15 polymorphic, ranging from inapparent forms in which 
individuals experience mild or no liver injury to 
acute hepatitis B, a moderately severe illness 
characterized by hepatocellular injury and 
inflammation to severe chronic liver disease (reviewed 

2 0 in Ganem and Varmus, 1987, Ann. Rev. Biochem. 56:651- 
693). People infected with HBV are often unaware of 
it. After a two- to six-month incubation period, HBV 
infection can lead to acute hepatitis and liver 
damage, which cause abdominal pain, jaundice, elevated 

25 levels of certain enzymes in the blood and other 

symptoms. At this stage, hepatitis B viral infection 
can be diagnosed by detecting hepatitis B surface 
antigen (HBsAg) in a patient 1 s blood serum. More 
frequently, the disease remains permanently 

30 asymptomatic- Although many chronic carriers appear 
healthy, they can still transmit the hepatitis B virus 
to those with whom they have close contact, thereby 
starting the cycle of disease anew. 

Variation in host immune response to virus 

35 infected cells appears to be a major determinant in 
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the degree of severity of liver damage in individuals. 
Both humoral and cellular immune responses to HBcAg 
and HBsAg have generally been observed during acute 
and chronic HBV infection (reviewed in Robinson, 1986, 
5 Fundamental Virology, Fields, B.N. and Knipe, D.M. 
(eds.)f Raven Press, N.Y., pp. 657-679). 



liver disease, results from epidemiological and 
molecular biological studies indicate a connection 

10^ between hepadnavirus infection and hepatoma (liver 
cancer) (reviewed in Ganem and Varmus, supra; 
Machowiak, 1987, Am. J. Med. 82:79-97; and Tiollais et 
al., 1985, Nature (London) 317:489-495. There is a 
strong correlation between high rates of chronic HBV 

15 infection and incidence of hepatoma, e.gr. , in 
Southeast Asia and equatorial Africa. 



recover completely from the disease. After recovery, 
the patient continues to produce low levels of 

2 0 antibodies specific for HBV, and thus is immune to HBV 
for several years. In the event of a new HBV attack, 
the antibody level rises quickly and neutralizes the 
virus. In chronically-infected patients, the virus 
survives in the liver, and may cause cirrhosis and, 

25 eventually, hepatocarcinoma . Chronic infection occurs 
in approximately 5-10% of infected adults and 80% of 
infected infants. Hepatitis B is, therefore, 
primarily a disease of infants in developing 
countries, and is mostly confined to adults in Western 

30 countries. 



Current methods used to treat HBV infection 
may be divided into two categories, antivirals and 
immunomodulatory agents (reviewed in Hoofnagle, Ann. 



In addition to causing acute and chronic 



Usually a patient with acute hepatitis will 



2.3. 



VACCINATION AND THERAPIES 
FOR HBV INFECTION 
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Int. Med. 107:414-415). One analog that has been 
widely tested is vidarabine (adenine arabinoside) , an 
adenine analog with potent activity against 
herpesviruses. While the inhibition of viral 
5 replication was observed to be nearly complete and 
sustained in some patients, the inhibition of 
replication was partial and transient in other 
patients. One drawback involved in using this drug is 
that it is highly water insoluble and has to be 
10 administered intravenously as a constant infusion. A 
monophosphorylated derivative of vidarabine, 
vidarabine phosphate, can be administered by rapid 
intravenous infusion or intramuscularly. Even though 
vidarabine phosphate therapy resulted in the clearance 
15 of serum hepatitis- B virus DNA, it did not lead to a 
sustained improvement in the accompanying liver 
disease. Other antivirals that have been tested for 
their effectiveness in treating HBV infection include 
acyclovir and suramin. 



patients were treated with alpha-interferons for 1-6 
months. it was found that only 25-40% of the patients 
responded to this therapy. The combination of 
vidarabine phosphate and human leukocyte interferon 
25 proved to be toxic. Other therapies that have been 
studied include the administration of inter leukin-2 , 
gamma interferon, and short course corticosteroids. 



vaccine comprising a 22 nm polypeptide particle made 
30 of mature hepatitis B surface antigen. 



report that 2-4% of the general population are 
nonresponsive to the existing recombinant hepatitis 
vaccine, which contains only the S antigen. Milich et 
35 al. (1986, J. Immunol. 137:315) suggest that this lack 



20 



Studies have also been undertaken in which 



U.S. Patent No. 4,471,901 discloses a 



Szmuness et al. (1981, J. Med. Virol. 8:123) 
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of an anti— S response may be overcome by immunizing 
with preS epitopes in combination with S. Neurath et 
al. (KPO publication no. 243,913) disclose an HBV 
vaccine which includes amino acid sequences from 
5 within the preS region of the HBV surface antigen 
linked to a lipid vesicle. Other groups disclose 
various surface antigen constructs for vaccine use 
(U.S. Patent No. 4,816,564; U.S. Patent No. 4,683,136; 
PCT international Publication No. *iP 8810301) . 

10 , Immunization of chimpanzees with recombinant or 

purified HBcAg has been shown to have some protective 
effect against HBV challenge (Tabor and Geretz, 1984, 
Lancet 1:172; Murray et al., 1984, EMBO J. 3:645). 
However, vaccines employing purified HBV antigens are 

15 disadvantageous due to the laborious purification 
procedures required. 



consisting of fusions to the core antigen expressed in 
bacteria, yeast or vaccinia (see, e.g., U.S. Patent 

20 No. 4,859,465; U.S. Patent Nos. 4,818,527 and 

4,882,145). In some cases core particles are formed 
which expose the foreign antigenic determinants on 
their surface (Clarke et al., 1987, Nature 
330:381-384; Borisova et al., FEBS Letters 

25 259:121-124). 



vaccinia virus recombinants expressing HBV surface 
antigens have been reported (Smith et al., 1983, 
Nature 302:490-495; Cheng and Moss, 1987, J. Virol. 
30 61(4) :1286-1290; Cheng et al. , 1986, J. Virol. 

60(2) :337-344; Paoletti et al., 1984, Proc. Natl. 
Acad. Sci. USA 81:193-197; Moss et al., 1984, Nature 
311:67-69; U.S. patent application Serial No. 
6-445,892 filed December 1, 1982). 



A number of .groups have made constructs 



Potential live vaccines consisting of 



35 
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2.4. VACCINIA VIRUS 
Vaccinia virus is a poxvirus (for a review, 



see Moss, 1990, Virology, 2d ed. , Ch. 74, Fields et 
al., eds.. Raven Press, Ltd., New York, pp. 2079-2111) 
5 which replicates within the cytoplasm of infected 
cells. Vaccinia virus contains a linear double- 
stranded genome of approximately 187 kilobase pairs, 
encoding approximately 200 proteins. A large number 
of proteins are encoded by the virus because the virus 
10 establishes infection in the cytoplasm of the cell 

rather than in the nucleus, and therefore must provide 
its own set of enzymes to replicate its DNA and to 
transcribe its genes. 



15 involving the use of vaccinia virus as a vector to 
express foreign genes inserted into its genome 
(Mackett et al. , 1982, Proc. Natl. Acad. Sci. USA 
79:7415-7419; Mackett et al. , 1984, J. Virol. 49:857- 
864; Panicali and Paoletti, 1982, Proc. Natl. Acad. 

20 Sci. USA 79:4927-4931; PCT International Application 
No. W0 91/12318; European Patent Publication No. 
83,286; Murphy, 1989, Res. Virol. 140:463-491; Moss 
and Flexner, 1987, Ann. Rev. Immunol. 5:305-324; 
Hruby, 1990, Clin. Microbiol. Rev. 3 (2) : 153-170 ; U.S. 

25 Patent No. 4,603,112; U.S. Patent No. 4,722,848; U.S. 
patent application Serial No. 7-072,455 filed July 13, 
1987). Upon introduction into host animals, the 
recombinant vaccinia virus expresses the inserted 
foreign gene and may thereby elicit a host immune 

30 response to such gene products. 

3. SUMMARY OF THE INVENTION 
The present invention provides recombinant 
vaccinia viruses which are capable of expressing a 
35 plurality of immunogenic HBV epitopes comprising at 



A vaccine approach has been described, 



17:57:02 



WOSM/12617 PCT/US93/11474 

- 9 - 



least: one core antigen epitope, at least one surface 
antigen epitope, or any combination of the foregoing. 
In one embodiment, a plurality of core and/or surface 
antigen epitopes is expressed. Core antigen epitopes 
5 are selected from the group consisting of wild-type 
core antigen, e antigen* and derivatives thereof. 
Surface antigen epitopes are those encoded by ORF S, 
and are selected from the group consisting of S, MS, 
LS, preS2, preSl, and derivatives there of.. "Epitope"*. 

10 as used herein refers to an antigenic determinant, 

i.e., an amino acid sequence that is capable of being . 
immunospecif ically bound by an antibody. Thus, for 
example, a protein containing a core antigen epitope 
can be detected by observing the ability of such 

15 protein to be bound by an anti-core protein antibody. 
The core antigen derivatives provided by the invention 
are those derivatives (including but not limited to 
fragments) which display core antigen antigenicity, 
i.e., are capable of being bound by an anti-core 

20 antibody. Similarly, the surface antigen derivatives 
provided by the invention are those derivatives 
(including but not limited to fragments) which display 
surface antigen antigenicity, i.e., are capable of 
being bound by an ahti-surface antigen antibody. 

25 In a specific embodiment, a combination of 

the foregoing core and/or surface antigen epitopes are 
expressed as a fusion protein in the vaccinia viruses 
of the invention. "Fusion protein," as used herein, 
refers to a protein comprising an amino acid sequence 

3 0 from a first protein covalently linked via a peptide 
bond at its carboxy terminus to the amino terminus of 
an amino acid sequence from a second, different 
protein. 

In one embodiment, a vaccine formulation of 
35 the invention contains a single type of recombinant 
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vaccinia virus of the invention. In another 
embodiment, a vaccine formulation comprises a mixture 
of two or more recombinant viruses of the invention . 
Preferred viruses containing plural HBV antigens 
5 include but are not limited to a combination of one of 
S, MS, LS, and full-length core, coreA8 (see Section 
6.1.2 infra) , preSl-coreA8 , preSl-complete core, 
core-preSl* , core-preS2, or core-S* (the asterisks 
denoting the presence of an immunogenic fragment of 
10 the preceding antigen; hyphens denoting a fusion 
protein) . 



surface antigen epitopes can be expressed by the same 
recombinant virus, under the control of different 

15 promoters active in vaccinia virus. In a preferred 
aspect, two divergently oriented promoters are used. 
In a specific embodiment, a recombinant vaccinia virus 
expresses a surface-core (written in the amino to 
car boxy —terminal order) or core-surface fusion protein 

20 (e.g., core-preSl, core-preS2, core-S, or preSl-core) 
under the control of a first promoter, and expresses a 
surface antigen (e.g. , MS) under the control of a 
second promoter. 



25 recombinant vaccinia viruses is provided. In one 
embodiment, such a mixture comprises a first virus 
capable of expressing core-preSl, a second virus 
capable of expressing core-preS2, and a third virus 
capable of expressing core-S or core-S*. In another 

30 embodiment, such a mixture comprises a first virus 
capable of expressing core-preSl or core-preSl* , a 
second virus capable of expressing core-preS2 , and a 
third virus capable of expressing S. In another 
embodiment, such a mixture comprises a first virus 

35 capable of expressing core-preSl or core— preSl*, and a 



In various embodiments, the core and/ or 



In another specific embodiment, a mixture of 



17:57:02 



WO 94/12617 PCT/US93/11474 

- 11 - 



second virus capable of expressing MS. In yet another 
embodiment , such a mixture comprises a first virus 
capable of expressing core-preS2, and a second virus 
capable of expressing S. 

5 

4. DESCRIPTION OF THE FIGURES 
Figure 1. Schematic diagram of HBV precore 
and core processing to produce core antigen and 
e antigen. - 

10 Figure 2. Nucleotide sequence (SEQ ID NO; 2) 

of the p7.5 promoter. 

Figure 3. Schematic diagram of plasmid 

pGS53 . 

Figure 4. Nucleotide sequence (SEQ ID NO: 3) 
15 of the pll promoter. 

Figure 5. Schematic diagram of plasmid 

pSClO. 

Figure 6. Nucleotide sequence (SEQ ID NO: 4) 
of the modified p7.5 promoter. 
20 Figure 7. Schematic diagram of plasmid 

pSC59. 

Figure 8. Schematic diagram of the HBV 
genome, illustrating the core and surface antigen- 
encoding regions. 
25 Figure 9. Schematic diagram of plasmid 

pAM6. 

Figure 10. Schematic diagram of plasmid 

PT7T318. 

Figure 11. Schematic diagram of plasmid 
30 pRO-01. A restriction enzyme site shown in 

parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 12. Schematic diagram of plasmid 
35 pRO-02. A restriction enzyme site shown in 
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parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 13. Schematic diagram of plasmid 

5 pRO-03. 

Figure 14.. Schematic diagram of plasmid 
pRO-05. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was .destroyed during 
10 cloning procedures. 

Figure 15. Schematic diagram of plasmid 
pRO— 06. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
15 cloning procedures * 

Figure 16. Schematic diagram of plasmid 
pT7T3/S. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
20 cloning procedures. 

Figure 17. Schematic diagram of plasmid 

PT7T3/S2. 

Figure 18. Schematic diagram of plasmid 

pLEH-01. 

25 Figure 19. Schematic diagram of plasmid 

pI*EH-02. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

30 Figure 20. Schematic diagram of plasmid 

pLEH-03. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 
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Figure 21. Schematic diagram of plasmid 
pLEH-04 . A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
5 cloning procedures. 

Figure 22. Nucleotide sequence (SEQ ID 
NO: 5) and amino acid sequence (SEQ ID NO: 6) of I*S. 

Figure 23 . Nucleotide sequence (SEQ ID 
NO: 7) and amino acid sequence- (SEQ ID NO: 8) of MS. 
10 Figure 24. Nucleotide sequence (SEQ ID 

NO: 9) and amino acid sequence (SEQ ID NO: 10) of S. 

Figure 25* Schematic diagram of plasmid 
pRO-09B. A restriction enzyme site shown in 
parentheses means that the site was present in the 
15 parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 26. Schematic diagram of plasmid 
PT7T3/C. A restriction enzyme site shown in 
parentheses means that the site was present in the 
20 parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 27, Schematic diagram of plasmid 

PT7T3/CHB. 

Figure 28. Schematic diagram of plasmid 

25 pT7T3/CRB. 

Figure 29. Schematic diagram of plasmid 
pT7T3 / CODM . 

Figure 30. Nucleotide sequence (SEQ ID 
NO: 12) and amino acid sequence (SEQ ID NO: 13) of 
3 0 coreAS. 

Figure 31. Schematic diagram of plasmid 
pCRB/24. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
35 cloning procedures. 
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Figure 32. Schematic diagram of plasmid 

pCODM/24. 

Figure 33. * Nucleotide sequence (SEQ ID 
NO: 14) and amino acid sequence (SEQ ID NO: 15) of full- 
5 length core in plasmid pCODM/24. 

Figure 34. Schematic diagram of plasmid 

PGS53X. 

Figure 35. Schematic diagram of plasmid 

PGS53/S. 

10 Figure 36. Schematic diagram of plasmid 

PGS53/S2. 

Figure 37. Schematic diagram of plasmid 

pGS53/Sl. 

Figure 38. Schematic diagram of plasmid 

15 pHTL-8 . 

Figure 39. Schematic diagram of plasmid 

pHTL-9 . 

Figure 40. Schematic diagram of plasmid 

pHTL-10. 

2 0 Figure 41. Schematic diagram of plasmid 

pHTL-lOM. 

Figure 42. Schematic diagram of plasmid 
pHTL-25. A restriction enzyme site shown in 
parentheses means that the site was present in the 
25 parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 43. Schematic diagram of plasmid 

pHTL-26. 

Figure 44. Nucleotide sequence (SEQ ID 
30 NO: 25) and amino acid sequence (SEQ ID NO: 26) of 
core-preSl*. 

Figure 45. Schematic diagram of plasmid 

pHTL-2 7. 
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Figure 46. Nucleotide sequence (SEQ ID 
NO:29) and amino acid sequence (SEQ ID NO: 30) of 
core-preS2 . 

Figure 47* Schematic diagram of plasmid 

5 pHTL-28. 

Figure 48* Nucleotide sequence (SEQ ID 
NO: 33) and amino acid sequence (SEQ ID NO: 34) of 
core-S*. 

Figure 49. Schematic diagram of plasmid 
lo pSClOAlacZ. 

Figure 50* Schematic diagram of plasmid 
pDPV. A restriction enzyme site shown in parentheses 
means that the site was present in the parental 
plasmid construct but was destroyed during cloning 
15 procedures . 

Figure 51. Schematic diagram of plasmid 

pDPV-01. 

Figure 52. Schematic diagram of • plasmid 

pRO-10. 

20 Figure 53. Schematic diagram of plasmid 

pRO-16. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures . 

25 Figure 54. Schematic diagram of plasmid 

pRO-ll. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

30 Figure 55. Schematic diagram of plasmid 

pRO-17. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but .was destroyed during 
cloning procedures. 

35 
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Figure 56. Schematic diagram of plasmid 
pRO-22. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
5 cloning procedures. 

Figure 57. Schematic diagram of plasmid 

pRO-12 . 

Figure 58. Schematic diagram of plasmid 
pRO-13B. A restriction enzyme site- shown in 
10 parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 59. Schematic diagram of plasmid 
pRO-18. A restriction enzyme site shown in 
15 parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 60. Schematic diagram of plasmid 

pRO-19 . 

20 Figure 61. Schematic diagram of plasmid 

pRO-19/24. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

25 Figure 62. Schematic diagram of plasmid 

pHTL-5. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

30 Figure 63. Schematic diagram of plasmid 

pHTL-6. 

Figure 64 . Schematic diagram of plasmid 

pHTL-7 . 

Figure 65. Schematic diagram of plasmid 

35 pHTL-11. 



17:57:02 



WO 94/12617 PCT/US93/11474 

- 17 - 



Figure 66. Schema-tic diagram of plasmid 

pHTL-12 . 

Figure 67 . Schematic diagram of plasmid 
pRO-16M. A restriction enzyme site shown in 
5 parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 68. Schematic diagram of plasmid 

pHTL-30. 

*° Figure 69 - Schematic diagram of plasmid 

pHTL-13 . 

Figure 70. Schematic diagram of plasmid 
pHTIi-15. a restriction enzyme site shown in 
parentheses means that the site was present in the 
15 parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 71. Schematic diagram of plasmid 
pHTL-17. A restriction enzyme site shown in 
parentheses means that the site was present in the 
2 0 parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 72. Schematic diagram of plasmid 

pHTL-3 3. 

Figure 73. Schematic diagram of plasmid 

25 pHTL-3 4. 

Figure 74. Schematic diagram of plasmid 

pHTL-35. 

Figure 75. Schematic diagram of plasmid 

pHTL-36. 

30 Figure 76. Schematic diagram of plasmid 

pPB-05. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but. was destroyed during 
cloning procedures. 
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Figure 77. Schematic diagram of plasmid 

pPB-09. 

Figure 78- Schematic diagram of plasmid 
pET-3d. A restriction enzyme site shown in 
5 parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 79. Schematic diagram of plasmid 
pT7/core. A restriction enzyme site shown in 
10 parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 80- Schematic diagram of plasmid 

pPB-04 . 

15 Figure 81. Nucleotide sequence (SEQ ID 

NO:44) and amino acid sequence (SEQ ID NO:45) of 
preSl-coreA8 . 

Figure 82. Schematic diagram of plasmid 
pRO-21. A restriction enzyme site shown in 

20 parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 83. Nucleotide sequence (SEQ ID 
NO: 48) and amino acid sequence (SEQ ID NO: 49) of 

25 preSl-full-length core. 

Figure 84. Schematic diagram of plasmid 
pRO-23. A restriction enzyme site shown in • 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 

30 cloning procedures. 

Figure 85. Schematic diagram of plasmid 
pHTL-23. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 

35 cloning procedures. 
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Figure 86. Schematic diagram of plasmid 
pHTL— 24. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
5 cloning procedures. 

Figure 87. Schematic diagram of plasmid 
pHTL— 18 . A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct hut was destroyed during 
10 cloning procedures. 

Figure 88. Schematic diagram of plasmid 
pHTL-14. A restriction enzyme site shown in 
parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
15 cloning procedures. 

Figure 89. Schematic diagram of plasmid 

pHTL-31. 

Figure 90. Schematic diagram of plasmid 
pHTL— 32. A restriction enzyme site shown in 

20 parentheses means that the site was present in the 
parental plasmid construct but was destroyed during 
cloning procedures. 

Figure 91. A representative Western blot of 
polypeptides in cells infected by recombinant vaccinia 

25 viruses, probed with anti-S antibodies. Lane A 
contains proteins isolated from nonrecombinant 
(wild-type) vaccinia virus. Lane B contains proteins 
from a recombinant virus which does not express any 
S-related antigens. Lanes C (pRO-18) and D (pGS53/Sl) 

30 are samples from recombinant viruses expressing the 
two forms of the LS antigen, nonglycosylated and 
glycosylated polypeptides of expected molecular 
weights 39 kX> and 42 )cD. Lanes- E (pGS53/S) and F 
(pHTL-8) contain proteins from recombinant viruses 

35 expressing the two forms of the S antigen, with 
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nonglycosylated and glycosylated forms having expected 
molecular weights of 24 kD and 27 kD. Lanes G 
(pHTL-14), H (pGS53/S2) and I (pRO-23) contain 
proteins from recombinant viruses expressing the two 
5 forms of the MS antigen, with singly and doubly 

glycosylated forms pf expected molecular weight 33 kD 
and 36 kD, respectively, 

5. nRTATIjED DESCRIPTION OF T HE . INVENTION. 

10 The present invention provides recombinant 

vaccinia viruses which are capable of expressing a 
plurality of immunogenic HBV epitopes comprising at 
least one core antigen epitope, at least one surface 
antigen epitope, or any combination of the foregoing. 

15 In one embodiment, a plurality of core and/or surface 
antigen epitopes is expressed. Core antigen epitopes 
are selected from the group consisting of wild-type 
core antigen, e antigen, and derivatives thereof. 
Surface antigen epitopes are those encoded by ORF S, 

20 and are selected from the group consisting of S, MS, 
LS, preS2, preSl, and derivatives thereof. "Epitope" 
as used herein refers to an antigenic determinant, 
i.e., an amino acid sequence that is capable of being 
immunospecif ically bound by an antibody. Thus, for 

25 example, a protein containing a core antigen epitope 
can be detected by observing the ability of such 
protein to be bound by an anti-core protein antibody. 
The core antigen derivatives provided by the invention 
are those derivatives (including but not limited to 

3 0 fragments) which display core antigen antigenicity, 
i.e., are capable of being bound by an anti-core 
antibody. Similarly, the surface antigen derivatives 
provided by the invention are those derivatives 
(including but not limited to fragments) which display 

35 
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surface antigen antigenicity, i.e., are capable of 
being bound by an anti-surface antigen antibody. 

In a specific embodiment, a combination of 
the foregoing core and/or surface antigen epitopes is 
5 expressed as a fusion protein in the vaccinia viruses 
of the invention* "Fusion protein," as used herein, 
refers to a protein comprising an amino acid sequence 
from a first protein c ova lent ly linked via a peptide 
bond at its carboxy terminus to the amino terminus of 
10 an amino acid sequence from a second, different 
protein. 

In one embodiment, a vaccine formulation of 
the invention contains a single type of recombinant 
vaccinia virus of the invention. In another 

15 embodiment, a vaccine formulation comprises a mixture 
of two or more recombinant viruses of the invention. 
Preferred viruses contain plural HBV antigens 
including but not limited to a combination of one of 
S, MS, I*S, and full-length core, coreA8 (see Section 

20 6.1.2 infr-a) , preSl-coreA8, preSl-complete core, 

core-preSl*, core-preS2, or core-S* (the asterisks 
denoting the presence of an immunogenic fragment of 
the preceding antigen; hyphens denoting a fusion 
protein) . 

25 in various embodiments, the core and/or 

surface antigen epitopes can be expressed by the same 
recombinant virus, under the control of different 
promoters active in vaccinia virus. In a preferred 
aspect, two divergently oriented promoters are used. 

30 In a specific embodiment, a recombinant vaccinia virus 
expresses a surface-core (written in the amino to 
carboxy-terminal order) or core-surface fusion protein 
(e.g., core-preSl, core-preS2, core-S, or preSl-core) 
under the control of a first promoter, and expresses a 
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surface antigen (e.g., MS) under the control of a 
second promoter. 

In another specific embodiment, a mixture of 
recombinant vaccinia viruses is provided, comprising a 
5 first virus capable of expressing core-preSl, a second 
virus capable of expressing core-preS2, and a third 
virus capable of expressing core-S. In another 
embodiment, such a mixture comprises a first virus 
capable of expressing core-preSl or core-preSl* , a 

10 second virus capable of expressing core-preS2, and a 
third virus capable of expressing S- In another 
embodiment, such a mixture comprises a first virus 
capable of expressing core-preSl or core-preSl* , and a 
second virus capable of expressing MS. In yet another 

15 embodiment, such a mixture comprises a first virus 
capable of expressing core-preS2, and a second virus 
capable of expressing S. 

Other specific embodiments include the 
recombinant viruses disclosed in Section 6 Intra. 

2 0 The vaccine formulations of the present 

invention provide efficacious protection against HBV 
infection, because, as live viral vaccines, 
multiplication of the vaccine strain and expression of 
its HBV epitope (s) in the host leads to prolonged 

25 immune stimulus, and the combination of HBV 

immunogenic epitopes thereby expressed leads to 
protective immunity. The prolonged immune stimulus 
provided by the live viral vaccines of the invention 
is of similar kind and magnitude to that occurring in 

30 natural subclinical infections, and therefore, can 
confer substantial long- lasting immunity. The 
identity and structural context of the immunogenic 
epitopes expressed according to the invention provide 
protection against HBV infection. Thus, the present 

35 invention provides methods of prevention or treatment 
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of HBV infection and its clinical manifestations 
(hepatitis, hepatoma) comprising administering one or 
more of the recombinant vaccinia viruses of the 
invention* 



invention provide one or more of the following 
benefits: stability for long periods without 
refrigeration; ease of production; low cost of 
production; ability to be adminieteiped by local 

10 workers without advanced medical training; and 

effectiveness in one dose* The vaccinia viruses which 
express an immunogenic fragment of more than one HBV 
protein cause immunological reactions against more 
components of HBV than can be presented by single 

15 subunit vaccines. 

5.1. DNA SEQUENCES ENCODING HBV EPITOPES 
Any DNA sequence which encodes an 
immunogenic (capable of provoking an immune response) 

20 HBV epitope, which produces protective immunity 
against hepatitis when expressed as a fusion or 
nonfusion protein in a recombinant vaccinia virus of 
the invention, alone or in combination with other HBV 
epitopes expressed by the same or a. different vaccinia 

25 virus recombinant, can be isolated for use in the 
vaccine formulations of the present invention. 



antigens can be identified by virtue of their 
hydrophilicity, by carrying out a hydrophilicity 

30 analysis (Hopp and Woods, 1981, Proc. Natl. Acad. Sci. 
USA 78:3824) to generate a hydrophilicity profile. A 
hydrophilicity profile can be used to identify the 
hydrophobic and hydrophilic regions of a protein and 
the corresponding regions of the gene sequence which 

35 encode such proteins. 



5 



Furthermore, the vaccine formulations of the 



In a specific embodiment, epitopes of HBV 
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Preferred epitopes are those of the surface 
and core antigens described supra, specific 
embodiments of which are disclosed in examples section 
6 herein (see also Table I) . However, many different 
5 HBV surface and core antigen proteins or peptides can 
be expressed. These include but are not limited to 
naturally occurring membrane glycoproteins produced 
from recombinant vaccinia infected cells, which are 
glycosylated and may be transported to the cell 

10 surface. Peptides or proteins comprising surface 
antigen epitopes include but are not limited to an 
approximately 42,000 dalton glycoprotein consisting of 
preSl-preS2-S; an -39,000 dalton unglycosylated 
protein consisting of preSl-preS2-S; an -36,000 dalton 

15 glycoprotein (glycosylated at two sites) consisting of 
preS2-S; an -33,000 dalton glycoprotein (glycosylated 
at one site) consisting of preS2-S; an -27,000 dalton 
glycoprotein (glycosylated at one site) consisting of 
S; and an -24,000 dalton unglycosylated protein 

20 consisting of S. 

The immunogenic peptides and proteins 
expressed by the recombinant viruses of the invention 
can be secreted or not secreted by a host cell 
infected with the virus. Such proteins which are not 

25 secreted can be intracellular proteins or cell surface 
membrane proteins. In a preferred aspect, the HBV 
protein or peptide (containing the immunogenic 
epitope) expressed by the recombinant vaccinia virus 
is localized extracellularly (secreted) or is a cell 

30 surface protein. However, extracellular or cell 
surface localization is not required, since 
intracellular localization can also evoke an effective 
immune response (see, e.g., Brown et al. , 1987, J. 
Inf. Disc. 155:86). 
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The sequences encoding the immunogenic 
peptides or proteins are preferably present in single 
copies, but can also be present in multiple copies 



within the vaccinia virus genome. If multiple copies 
5 are present, care must be taken to ascertain that the 
recombinant virus stably maintains each of the 
multiple copies. Xn a preferred aspect , in an 
embodiment in which two copies are present, stability 
of the copies in the genome is maintained by localising 
10 the individual copies relatively distant from each other 
in the genome with essential sequences between the 
copies, or, perhaps, by orienting the copies 
divergently. Stability of the copies in the genome can 
be confirmed by methods known in the art, e.g.. Southern 
15 analysis. In a specific embodiment, multiple copies are 
avoided by employing, in the recombinant vaccinia 
viruses of the invention, HBV sequences which do not 
overlap by (i.e., do not contain an identical sequence 
of) more than 20 nucleotides, and preferably, do not 
20 overlap by more than 10 nucleotides. In a particular 
embodiment wherein a recombinant virus of the invention 
encodes a first and a second protein, each protein 
containing core and/qr surface antigen epitopes, to 
maintain stability, the core and/or surface antigen 
25 sequences, or portions thereof displaying the 

antigenicity of such core and surface antigen sequences, 
respectively, in the first fusion protein are not 
present in the second fusion protein. 



30 sequence encoding the peptide or proteins comprising 
HBV immunogenic epitope (s) lacks introns, e.g., is a 
cDNA sequence, since any introns will not be spliced 
out. Thus, for example, no introns containing stop 
codons in the desired reading frame may be contained 

35 in the HBV-encoding nucleotide sequence. The 



In a preferred aspect, the nucleotide 
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nucleotide sequence preferably contains a ribosomal 
binding site, or is placed within an insertion vector 
(see Section 5.2) so as to be operatively linked to a 
ribosomal binding site. 
5 The DNA sequence encoding the HBV epitope (s) 

can be obtained from any. of numerous sources such as 
cloned HBV DNA, genomic HBV DNA, cDNA of HBV RNA, or 
chemically synthesized DNA (see, e.g., U.K. Patent 
Publication No. GB 2034323A--putelished^June- 4 , 1980; 

X0 Ganem and Vanaus, 1987, Ann- Rev. Biochem. 56:651-693; 
Tiollais et al. , 1985, Nature 317:489-495), and 
manipulated by recombinant DNA methodology well known 
. in the art (see Sambrook et al. , 1991, Molecular 
Cloning, A Laboratory Manual, 2d ed. , Cold Spring 

15 Harbor Laboratory Press, New York) . 

Plasmid pAM6 contains the entire genome of 
HBV (subtype adw) , and is available from the American 
Type Culture Collection (ATCC) . In this plasmid, the 
HBV DNA has been inserted into the cloning vector 

2 0 pBR322 at a BamHI restriction enzyme site within the 

preS2 region to allow replication of the DNA in E • 
coll. Thus, the MS and LS antigen open reading frames 
(ORFs) in pAM6 are interrupted. In order to express 
the MS and LS antigens, it is necessary to reunite the 
25 surface antigen ORFs (e.g., as described in the 
examples sections infra) . 

In order to generate HBV DNA fragments, the 
HBV DNA may be cleaved at specific sites using various 
restriction enzymes. Alternatively, one may use 

3 0 DNasel in the presence of manganese, or mung bean 

nuclease (McCutchan et al., 1984, Science 225:626), to 
fragment the DNA, or the DNA can be physically 
sheared, as for example, by sonication. The linear 
DNA fragments can then be separated according to size 
35 by standard techniques, including, but not limited to, 
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agarose and polyacrylamide gel electrophoresis and 
column chromatography. 

Any restriction enzyme or combination of 
restriction enzymes may be used to generate HBV DNA 
5 fragment (s) containing the desired epitope (s) , 

provided the enzymes do not destroy the immunopotency 
of the encoded product. Consequently, many 
restriction enzyme combinations may be used to 
generate DNA fragments which , when inserted into an 
10 appropriate vector, are capable of directing the 
production of the peptide containing the HBV 
epitope (s) . 

Once the DNA fragments are generated, 
identification of the specific DNA fragment containing 
15 the desired HBV sequence may be accomplished in a 

number of ways. For example, if a small amount of the 
desired DNA sequence or a homologous sequence is 
previously available, it can be used as a labeled 
probe (e.g., nick translated) to detect the DNA 
20 fragment containing the desired sequence, by nucleic 

acid hybridization. Alternatively, if the sequence of 
the derived gene or gene fragment is known, isolated 
fragments or portions thereof can be sequenced by 
methods known in the art, and identified by a 
25 comparison of the derived sequence to that of the 
known DNA or protein sequence. Alternatively, the 
desired fragment can be identified by techniques 
including but not limited to mRNA selection, making 
cDNA to the identified mRNA, chemically synthesizing 
30 the gene sequence (provided the sequence is known) , or 
selection on the basis of expression of the encoded 
protein (e.g., by antibody binding) after "shotgun 
cloning" of various DNA fragments into an expression 
system. 
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The sequences encoding HBV peptides to be 
expressed in recombinant vaccinia viruses according to 
the present invention*, whether produced by recombinant 
DNA methods, chemical synthesis, or purification 
5 techniques, include but are not limited to sequences 
encoding all or part (fragments) of the amino acid 
sequences of HBV-specific antigens, as well as other 
derivatives and analogs thereof. In a specific 
embodiment, the derivative or analog is functionally 

10 active, i.e., capable of exhibiting one or more 

functional activities associated with a full-length, 
wild-type HBV antigen (e.g., core or surface antigen). 
In a preferred embodiment, such derivatives or analogs 
which have the desired immunogenicity or antigenicity 

15 (capable of being bound by an antibody to the HBV 
antigen) can be encoded. In another specific 
embodiment, a surface antigen derivative which retains 
the ability to assemble into 22 nm particles is 
encoded. In yet another specific embodiment, a core 

20 antigen derivative which retains the ability to 

assemble into core particles (see, e.g., Cohen and 
Richmond, 1982, Nature 296:677-678) is encoded. In 
yet another embodiment, core and/or surface antigen 
derivatives which retain the ability to assemble into 

25 viral particles are encoded. Derivatives or analogs 
of HBV antigens can be tested for the desired activity 
by procedures known in the art, including but not 
limited to standard immunoassays. 

In particular, HBV antigen derivatives can 

3 0 be made by altering the encoding HBV antigen 

nucleotide sequences by substitutions, additions or 
deletions that provide for functionally equivalent 
molecules. Due to the degeneracy of nucleotide coding 
sequences, other DNA sequences which encode 

35 substantially the same amino acid sequence as a native 
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HBV gene or portion "thereof may be used in the 
practice of the present invention. These include but 
are not limited to nucleotide sequences comprising all 
or portions of HBV genes or cDNAs which are altered by 
5 the substitution of different codons that encode a 
functionally equivalent amino acid residue within the 
sequence/ thus producing a silent change. For 
example, one or more amino acid residues within the 
sequence can be substituted by -another amino «acid of a 

10 similar polarity which acts as a functional 
equivalent, resulting in a silent alteration. 
Substitutes for an amino acid within the sequence may 
be selected from other members of the class to which 
the amino acid belongs. For example, the nonpol'ar 

15 (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine. The polar neutral amino 
acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine. The positively 

20 charged (basic) amino acids include arginine, lysine, 
and histidine. The negatively charged (acidic) amino 
acids include aspartic and glutamic acid. 

Derivatives or analogs of HBV antigens 
include but are not limited to those peptides which 

25 are substantially homologous to an HBV antigen or 

fragments thereof, or whose encoding nucleic acid is 
capable of hybridizing to an HBV antigen-encoding 
nucleic acid sequence, e.g., under conditions of high 
stringency (e.g., 0.1 X SSC, 65 °C) . 

30 The HBV antigen derivatives and analogs can 

be produced by various methods known in the art. For 
example, a cloned HBV gene sequence can be modified by 
any of numerous* strategies known in the art (Sambrook 
et al., 1989, Molecular Cloning, A Laboratory Manual, 

35 2d ed., Cold Spring Harbor Laboratory, Cold Spring 
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Harbor, New York) . The sequence can be cleaved at 
appropriate sites with restriction endonuclease (s) , 
followed by further enzymatic modification if desired, 
isolated, and ligated in vitro. In the production of 
5 the gene encoding a derivative or analog of an HBV 
antigen, care should be taken to ensure that the 
modified gene remains within the same translational 
reading frame as the antigen, uninterrupted by 
translational stop signals, in the gene region where 

10 the desired HBV epitope (s) are encoded. 

Additionally, the HBV antigen-encoding 
nucleic acid sequence can be mutated in vitro or in 
vivo, to create and/ or destroy translation, 
initiation, and/or termination sequences, or to create 

is variations in coding regions and/or form new 

restriction endonuclease sites or destroy preexisting 
ones, to facilitate further in vitro modification. 
Any technique for mutagenesis known in the art can be 
used, including but not limited to, in vitro site- 

20 directed mutagenesis (Hutchinson, C. , et al. , 1978, J. 
Biol. Chem 253:6551), use of TAB® linkers (Pharmacia), 
etc. 

In another specific embodiment, the encoded 
HBV antigen derivative is a chimeric, or fusion, 

25 protein comprising an HBV protein or fragment thereof 
fused to a non-HBV amino acid sequence. Such a 
chimeric protein is encoded by a nucleic acid encoding 
the HBV coding sequence joined in-frame to a non-HBV 
coding sequence. Such a chimeric product can be made 

30 by ligating the appropriate nucleic acid sequences 
encoding the desired amino acid sequences to each 
other by methods known in the art, in the proper 
coding frame . 

A DNA sequence encoding an HBV epitope which 

3 5 is a hapten, i.e., a molecule that is antigenic in 
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that it can react selectively with cognate antibodies, 
but not immunogenic in that it cannot elicit an immune 
response when administered without adjuvants or 
carrier proteins, can be isolated for use in the 
5 vaccine formulations of the present invention, since 
it is envisioned that, in particular embodiments, 
presentation by the vaccinia viruses of the invention 
can confer immunogenicity to the hapten expressed by 
the virus. 

10 Once identified and isolated, the HBV DNA 

containing the sequence (s) of interest is then 
inserted into a recombinant vaccinia virus such that 
it can be expressed by such virus. Preferably, this 
is accomplished by first inserting the HBV DNA into a 

X5 plasmid vector which is capable of subsequent transfer 
to a vaccinia virus genome by homologous recombination 
(see Section 5.2 infra) . 

5.2. CONSTRUCTION OF INSERTION VECTORS 
CONTAINING SEQUENCES WHICH ENCODE 
HBV SURFACE AND /OR CORE EPITOPE f S > 

In a preferred aspect of the invention for 

constructing the recombinant vaccinia viruses, the 

desired DNA sequence encoding the HBV epitope (s) is 

inserted, using recombinant DNA methodology (see 

2 5 

Sambrook et al., 1989, Molecular Cloning, A Laboratory 
Manual, 2d ed.. Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York) into an insertion 
(preferably, plasmid) vector flanked by (preferably) 
nonessential vaccinia DNA sequences, thus providing 
30 for subsequent transfer of its chimeric gene(s) into 
vaccinia virus by homologous recombination. The HBV 
sequences, are placed in the vector such that they can 
be expressed under the control of a promoter 
functional in vaccinia virus. 
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Expression of foreign DNA in recombinant 
vaccinia viruses requires the positioning of promoters 
functional in vaccinia so as to direct the expression 
of the protein-coding HBV DNA sequences- Plasmid 
5 insertion vectors have been constructed to insert 
chimeric genes into vaccinia virus. One type of 
plasmid insertion vector is composed of: (a) a 
vaccinia virus promoter including the transcriptional 
initiation site; (b) one dr'nore" unique restriction 

XO endonuclease cloning sites located downstream from the 
transcriptional start site for insertion of foreign 
DNA fragments; (c) nonessential vaccinia virus DNA 
(such as the thymidine kinase (TK) gene) flanking the 
promoter and cloning sites which directs insertion of 

15 the chimeric gene into the homologous nonessential 

region of the virus genome; and (d) a bacterial origin 
of replication and antibiotic resistance marker for 
replication and selection in E. coli. Examples of 
such vectors are described by Mackett (Mackett et al. , 

20 1984. J. Virol. 49:857-864). The DNA encoding HBV 
epitope (s) is inserted into a suitable restriction 
endonuclease cloning site (b, above) . Various 
derivatives of this prototype insertion vector can 
also be used, e.g., with multiple restriction sites, 

25 with the 0-galactosidase gene (letcZ) to allow 

selection of plaques by color, with inducible gene 
expression (e.g., by inclusion of a bacterial lac 
repressor system), etc. (see, e.g., Boyle et al., 
1985, Gene 35:169-177; Chakrabarti et al., 1985, Mol. 

30 Cell. Biol. 5:3403-3409; Falkner et al., 1988, J. 

Virol. 62:1849-1854; Falkner et al., 1987, Nucl. Acids 
Res. 15:7192; Franke et al. , 1985, Mol. Cell. Biol. 
5:1918-1924; Panicali et al., 1986, Gene 47:193-199; 
Patel et al- , 1988, Proc. Natl. Acad. Sci. USA 

35 85:9431-9435; Fuerst et al., 1989, Proc. Natl. Acad. 
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Sci. USA 86:2549-2553). In addition to plasmid 
insertion vectors, insertion vectors based on single- 
stranded M13 bacteriophage DNA (Wilson et al., 1986, 
Gene 49:207-213) can be used. 
5 The inserted HBV DNA should preferably not 

contain introns since intron sequences will not be 
removed during the viral life cycle, and insertion 
should preferably be so as to place the HBV coding 
sequences in close proximity to the promoter, with no 
10 other start codons in between the initiator ATG and" 
the 5' end of the transcript. 

Preferably, a plasmid insertion vector such 
as described supra, with a convenient restriction site 
for insertion of the HBV DMA, is used. Alternatively, 
15 if the complementary restriction sites used to 

fragment the HBV DNA are not present in the cloning 
vector, the ends of the DNA molecules may be modified. 
Such modifications include producing blunt ends by 
digesting back single-stranded DNA termini or by 
2 0 filling the single-stranded termini so that the ends 
can be blunt-end ligated. Alternatively, any site 
desired may be produced by ligating nucleotide 
sequences (linkers) onto the DNA termini; these 
ligated linkers may comprise specific chemically 
25 synthesized oligonucleotides encoding restriction site 
recognition sequences. Other methods known in the art 
may be used. 

The plasmid insertion vector should contain 
transcriptional and translational regulatory elements 
30 that are active in vaccinia virus. High expression 
levels in the derived recombinant virus can be 
obtained by using strong promoters or, less 
preferably, by inserting multiple copies of a single 
HBV sequence. In the use of multiple copies, care 
35 should be taken to confirm stable maintenance of the 
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copies in the vaccinia virus genome (see discussion 
supra) • 

As mentioned supra, the plasmid should be 
configured so that the HBV sequences are under the 
5 control of a promoter active in vaccinia virus. Such 
a promoter can be a native vaccinia promoter or a non- 
native promoter active in vaccinia virus. The 
promoter can be natural or synthetic. The plasmid can 
contain more than one promoter, each direct ing_ the 

10 expression of a different sequence encoding HBV 
epitope (s). In a preferred aspect, two such 
divergently oriented promoters are used. Promoters 
which can be used in the insertion vectors include but 
are not limited to the vaccinia virus thymidine kinase 

15 (TK) promoter, the 7.5K promoter (termed herein 

M p7.5 M ), a modified p7.5 (see Infra), the 11K promoter 
(termed herein M pll") , the F promoter (Paoletti et 
al., 1984, Proc. Natl. Acad. Sci- USA 81:193-197), and 
various early and late vaccinia promoters (see Moss, 

20 1990, Virology, 2d ed. , ch. 74, Fields et al., eds. , 
Raven Press, Ltd., New York, pp. 2079-2111). In the 
use of a native vaccinia virus promoter, a promoter 
active both early and late in the viral life cycle is 
preferred. 

25 In a specific embodiment, the plasmid 

insertion vector contains (for eventual transfer into 
vaccinia virus) a T7 RNA polymerase coding sequence 
under the control of a promoter active in vaccinia 
virus, and the HBV coding sequences under the control 

30 of the T7 promoter. In another specific embodiment, a 
plasmid insertion vector contains a co-expression 
system consisting of divergently oriented promoters, 
one directing transcription of -the HBV sequences, the 
other directing transcription of a reporter gene or 

35 selectable marker, to facilitate detection or 
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selection of the eventual recombinant: vaccinia virus 
(see, e.g., Fuerst et al., 1987, Mol. Cell. Biol. 
5:1918-1924). In preferred aspects, the p7.5, pll, or 
modified versions thereof are employed, some of which 
5 are described in more detail below. 

A strong vaccinia promoter, denoted here as 
M p7.5" (Cochran et al., 1985, J. Virol. 54:30-37), 
having the sequence provided in Figure 2 (SEQ ID 
NO: 2), can be one of several -vaccinia «or vaccinia-like 
10 promoters used to direct expression of the HBV 

sequences. The p7.5 promoter is so named because it 
normally directs expression of a vaccinia polypeptide 
of 7.5 kilodaltons molecular weight. The transition 
from early to late events in viral replication is 
15 generally considered to be marked by the onset of 
viral DNA replication, and p7.5 contains elements 
which are active both early and late in viral 
replication, making it act as a constitutive promoter. 
Promoter p7.5 is contained in plasmid pGS53 (Fig. 3) 
20 (Chakrabarti et al., 1985, Mol. Cell. Biol. 

5:3403-3409; kindly provided by Dr. Bernard Moss, 
National Institutes of Health). This plasmid contains 
fragments of the vaccinia thymidine kinase (TK) gene 
positioned so as to facilitate homologous 
25 recombination of the plasmid into that site in the 
vaccinia genome. 

Another vaccinia promoter, pll, in nature 
directs expression of a vaccinia structural protein of 
11 kilodaltons (European patent application 0198328, 
30 Hoffman LaRoche) . pll, unlike the p7.5 promoter, is 
active only late in viral replication. The nucleotide 
sequence of pll is shown in Figure 4 (SEQ ID NO: 3). 
As described in Example 6, infra, a number of 
constructs were made using this promoter, both alone 
35 and in combination with p7.5, to direct expression of 
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HBV antigens. The 11 kD promoter is contained in a 
plasmid called pSCIO (Fig. 5, Chakrabarti et al., 
supra; kindly provided by Dr. Bernard Moss), which 
contains the gene coding for the bacterial enzyme 
5 0-galactosidase, lacZ, located downstream of the pll 
promoter. In a specific embodiment, a pll-£-gal 
cassette is inserted into the middle of the vaccinia 
TK gene so that this expression unit can be inserted 
by in vivo recombination into the TK gene in the. 
10 virus. 

A strong vaccinia-like promoter used to 
direct expression of the hepatitis B antigens, termed 
herein "modified p7.5 w has the nucleotide sequence set 
forth in Figure 6 (SEQ ID NO: 4). The modified p7.5 

15 promoter (constructed in the laboratory of Dr. Bernard 
Moss) is a strong synthetic promoter that is active 
both early and late in viral replication. The 
modified p7.5 promoter has a sequence based partly on 
the sequence of the p7.5 promoter. The modified p7.5 

20 promoter is contained in plasmid pSC59 (Fig. 7; kindly 
provided by Dr. Bernard Moss) . 

Specific initiation signals are also 
required for efficient translation of inserted protein 
coding sequences. These signals include the ATG 

25 initiation codon and adjacent sequences. In cases 

where the entire HBV gene including its own initiation 
codon and adjacent sequences are inserted into the 
appropriate vectors, no additional translational 
control signals may be needed. However, in cases 

30 where only a portion of the gene sequence is inserted, 
exogenous translational control signals, including the 
ATG initiation codon, must be provided. The 
initiation codon must furthermore be in phase with the 
reading frame of the protein coding sequences to 

35 ensure translation of the entire insert. These 
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exogenous trans 1 at i on a 1 control signals and initiation 
codons can be of a variety of origins, both natural 
and synthetic* 



5 initiation codon of the HBV antigen or portion thereof 
is "upregulated, " i.e., contains a sequence upstream 
of that encoding the initiator methionine that matches 
a consensus sequence associated with strong ribosone 
binding. In a specific embodiment , the ATG codon of 
10 MS is upregulated. 



vector contains at least one set of HBV coding 
sequences operatively linked to a promoter, flanked by 
sequences preferably nonessential for vaccinia viral 
15 replication. Such nonessential sequences include but 
are not. limited to the TK gene (Mackett et al., 1984, 
J. Virol. 49:857-864), the vaccinia Hindlll-F DNA 
fragment (Paoletti et al., 1984, Proc. Natl. Acad. 
Sci. USA 81:193-197), the vaccinia growth factor gene 
2 0 situated within both terminal repeats (Buller et al., 
1988, J. Virol. 62:866-874), the N2 and Ml genes 
(Tamin et al., 1988, Virology 165:141-150), the Ml 
subunit of the ribonucleotide reductase gene in the 
vaccinia HindIII-I DNA fragment (Child et al., 1990, 
25 Virology 174:625-629), the vaccinia hemagglutinin 

(Shida et al., 1988, J. Virol. 62:4474-4480), vaccinia 
14 kD fusion protein gene (Rodriguez et al., 1989, 
Proc. Natl. Acad. Sci. USA 86:1287-1291), etc. (see 
also Buller and Palumbo, 1991, Microbiol. Rev. 
30 55(1) :80-122) . TK sequences are preferred for use; 

use of such sequences results in the generation of TK~ 
recombinant viruses. TK~ recombinants have been shown 
to be attenuated relative to the parent strain (Buller 
et al., 1985, Vaccines 85:163), which may be an 



In a preferred embodiment, the ATG 



As described supra, the plasmid insertion 



35 
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advantage from a safety standpoint in the development 
of new vaccinia-based vaccines. 



vaccinia recombinants containing HBV sequences within 
5 coding regions such as TK may be overly attenuated , 
the HBV-encoding sequences are inserted into a region 
of the vaccinia genome that does not encode protein, 
e*g*., an intergenic region. In addition, insertion 
into such a non-coding region in conjunction with 

10 insertion into a nonessential coding region such as TK 
might enable the generation of recombinant vaccinia 
viruses with multiple insertions which are no more 
attenuated than those viruses with a single insertion 
only in a nonessential coding region. The strategy 

15 for choosing a non-coding region is based on several 
parameters, one of which is the length of the region, 
i.e., the distance between surrounding genes. A 
larger distance is preferable to a smaller distance in 
order to avoid regulatory regions necessary for the 

2 0 expression of surrounding genes- Another parameter is 
the function of the surrounding genes, nonessential 
genes being preferable to essential genes. A sequence 
and map of the wild-type vaccinia Copenhagen strain is 
found in Goebel et al., 1990, Virology 179:247-266. 

2 5 For strains which have not been sequenced, the 

sequence of the chosen non-coding region is obtained 
or determined according to conventional nucleotide 
sequence techniques. 



3 0 construction of an insertion vector with intergenic 
flanking regions can be carried out as follows: 



candidate nonessential non-coding region for insertion 
of foreign DNA, a pair of polymerase chain reaction 
35 (PCR) primers having nucleotide sequences 



In an alternative embodiment, because 



By way of example but not limitation, 



After an intergenic region is selected as a 
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corresponding to a sequence within the chosen 
intergenic region is synthesized or obtained 
commercially. Primers are preferably chosen such that 
there is between 100 and '300 nucleotides between 
5 primers of a pair* 300 nucleotides has been 

designated herein as an upper length limit because the 
largest known non-coding region in the Copenhagen 
strain is approximately 400 nucleotides in length. In 
order to avoid potential regions of control of 

10 surrounding genes, it is preferable to leave a minimum 
of 50 nucleotides untouched on either side of the 
insertion region. Longer DNA sequences can also be 
amplified if primers hybridizing within the 
surrounding genes are used. Primers are designed so 

15 as to hybridize to complementary strands and allow 
amplification of the region between them. To 
facilitate later cloning, it is desirable to build a 
convenient restriction site into the primer sequence. 
Examples of sets of primers which may be used for the 

20 following non-coding regions, along with convenient 
restriction sites, are as follows. 



TABLE 1. 

25 



30 



35 



Non-coding 
Region 


Primers 


SEQ ID NO: 


SITE 


FI4L-FI5L 


5 ' GTTG GTA G A ATTCC AATTAT3 ' 
5'AAGAATGA1CGAIACAGTTT3' 


50 
51 


EcoRI 
Clal 


C12L-C11R 


5'GTACCCCGAATTCATACTTA3' 
5 ' AATA ATATATCG ATA ATTGT3 ' 


52 
53 


EcoRI 
Clal 


A53R-A55R 


5 / GTTGGAATTCCGCTACTGAT3' 
5 TAACC AAGTATCG ATATAAT3 ' 


54 
55 


EcoRI 
Clal 
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Viral DNA is isolated from the wild type, 
e.g., Wyeth virus according to known procedures - The 
primers are then hybridized to the viral DNA under 
non-denaturing conditions. PCR (see Section 5.3) is 
5 then performed on the isolated DNA using the pair of 
primers corresponding to the chosen intergenic region. 
The amplified DNA is then digested with restriction 
enzymes which cut within the primer sequences and the 
resulting fragment is gel purified. The gel purified 
10 DNA is cloned into an appropriate vector using 
standard cloning techniques. 

The cloned non-coding region is digested 
. with a restriction enzyme at a site within the non- 
coding region. Noncoding regions which can 
15 potentially be used, with their approximate size and a 
convenient restriction site for insertion, are set 
forth in Table 2. 



TABLE 2 



25 



Non-coding 
Region 


Approximate Band Size of 
Non-coding Region 


Internal 
Restriction Site 


F14L-F15L 


210 bp 


SnaBI 


C12L-C11R 


260 bp 


Bell 


A53R-A55R 


240 bp 


SnaBI 



In order to clone the desired HBV sequences downstream 
of vaccinia promotors in the vector, the desired 
promoter/HBV DNA construct is inserted within the 
cloned non-coding region using the appropriate 
restriction enzymes and sites. Preferably, the HBV 
sequences are inserted so as to be flanked by at least 
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about: 100 bp of the noncoding vaccinia DNA. Most 
preferably the sequences are inserted into the vector 
so as to be surrounded by about 500 bp of vaccinia DNA 
on each side, which may include some DNA from 
5 nonessential vaccinia genes. 

5.3. CONSTRUCTION AND IDENTIFICATION OF 
RECOMBINANT VACCINIA VIRUSES 
ENCODING HBV EPITOPE (SI 



10 



15 



20 



25 



30 



35 



Recombinant 'vaccinia viruses are preferably 
produced by transfection of the recombinant insertion 
vectors containing the HBV sequences into cells 
previously infected with vaccinia virus. 
Alternatively, transfection can take place prior to 
infection with vaccinia virus. Homologous 
recombination takes place within the infected cells 
and results in the insertion of the foreign gene into 
the viral genome, in the region corresponding to the 
insertion vector flanking regions. The infected cells 
can be screened using a variety of procedures such as 
immunological techniques, DNA plaque hybridization, or 
genetic selection for recombinant viruses which 
subsequently can be isolated. These vaccinia 
recombinants preferably retain their essential 
functions and infectivity and can be constructed to 
accommodate up to approximately 35 kilobases of 
foreign DNA. 

Transf ections may be performed by procedures 
known in the art, for example, a calcium 
chloride-mediated procedure (Mackett et al., 1985, The 
construction and characterization of vaccinia virus 
recombinants expressing foreign genes, in DNA Cloning, 
Vol. II, Rickwood and Haines (eds.)# IRL Press, 
Oxford-Washington, D.C.) or a liposome-mediated 
procedure (Rose et al., 1991, Biotechniques 
10:520-525) . 
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Where, as is preferred, flanking TK 
sequences are used to promote homologous 
recombination, the resulting recombinant viruses thus 
have a disrupted TK region, permitting them to grow on 
5 a TK" host cell line such as Rat2 (ATCC Accession No. 
CRL 1764) in the presence of 5-bromo-2 • -deoxyuridine 
(BUDR) , under which conditions non-recombinant (TK + ) 
viruses will not grow* TK~ recombinants have been 
shown to be attenuated relative to the parent strain 

10 (Buller et al., 1985, Infectious vaccinia virus TK~ 
recombinants that express foreign genes are less 
virulent than wild-type virus in mice, in Vaccines 85, 
Molecular and chemical Basis of Resistance to 
Parasitic, Bacterial and Viral Disease, R.A. Lerner, 

15 chanock, R.M. , and Brown, F- (eds.), Cold Spring 

Harbor Laboratory, New York, pp. 163-167) , which may 
be an advantage from a safety standpoint in the 
development of new vaccinia-based vaccines. 

In another embodiment, recombinant vaccinia 

20 viruses of the invention are made by in vitro cloning, 
and then packaging with a poxvirus sensitive to a 
selection condition, rather than by homologous 
recombination. For example, the HBV DNA sequences can 
be inserted into vaccinia genomic DNA using standard 

2 5 recombinant DNA techniques in vitro; this recombinant 
DNA can then be packaged in the presence of a "helper" 
poxvirus such as a temperature sensitive vaccinia 
virus mutant or a fowlpox virus which can be selected 
against under the appropriate conditions. 

30 Various vaccinia virus strains known in the 

art can be used to generate the recombinant viruses of 
the invention. Characterized strains of low virulence 
are preferred. A preferred vaccinia virus is the New 
York City Department of Health Laboratories strain, 

35 prepared by Wyeth (available from the American Type 
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Culture Collection (ATCC) , Accession No. VR-325) . 
Other vaccinia strains include but are not limited to 
the Elstree and Moscow strains, the strain of Rivers 
(CV-i and CV-2), and the LC16m8 strain of Hashizume. 
5 Selection of the recombinant vaccinia virus 

can be by any method known in the art, including 
hybridization techniques (e.g., using HBV DNA 
sequences as a hybridization probe) , immunological 
techniques (e.g., assay for binding to antibodies 

10 recognizing the encoded HBV epitope (s) ) , etc* In a 
preferred aspect where TK flanking sequences are used 
in the insertion vector, selection is for TK~ 
recombinants, as described above; screening for the 
correct recombinant is then carried out by molecular 

15 analyses as described Infra. In many preferred 
aspects, the method of choice for selection is 
dictated by the choice of insertion vector used to 
generate the recombinant viruses. For example, if a 
coexpression system vector is used, containing a 

20 reporter gene or selectable marker, detection of 
recombinants is carried out by screening for the 
reporter gene expression or selecting for selectable 
marker expression. In an embodiment where the lacZ 
gene is incorporated, color screening methods known in 

25 the order can be used to detect expression of 

0-galactosidase. Incorporation of neomycin or gpt 
genes allows selection by antibiotic resistance. 
Plaque size can also be used for screening. Many 
other methods will be known to the skilled artisan and 

3 0 can be used . 

The selected recombinant vaccinia virus is 
then generally plaque-purified (preferably by at least 
three rounds of purification frpro a single viral 
plaque) , and subjected to molecular analyses to verify 

35 its identity and purity. Preferably, two types of 
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molecular analyses are carried out: (i) nucleic acid 
analyses; and (ii) assays for expression of the 
encoded HBV epitope (s) . 

For nucleic acid analyses, to verify that 
5 the recombinant vaccinia virus contains the DNA 
. sequences encoding HBV epitope (s) , hybridization 
assays are preferably carried out using a labelled 
nucleic acid containing HBV sequences as a 
hybridization probe. For example, plaque-lift 
10 hybridizations (see Mackett et al., 1985, in DNA 
Cloning, Vol. II, Rickwood and Haines (eds.) / IRL 
Press , Oxford-Washington , D . C • ) , Southern 
hybridizations (Southern, 1975, J. Mol. Biol. 98:503- 
517; see Section 8.1. Infra) , and Northern 

15 hybridizations (Freeman et al. , 1983, Proc. Natl. 
Acad. Sci. USA 80:4094-4098) can be carried out. 
Other methods of nucleic acid analysis which can be 
employed are restriction endonuclease mapping 
(Sambrook et al., 1991, Molecular Cloning, A 

2 0 Laboratory Manual, 2d ed. , Cold Spring Harbor 

Laboratory Press, New York), and DNA sequence analysis 
(e.g., according to Maxam and Gilbert, 1980, Meth. 
Enzymol. 65:499-560; Sanger et al., 1977, Proc. Natl. 
Acad. Sci. USA 74:5463; or Tabor and Richardson, U.S. 

25 Patent No. 4,795,699; or by use of an automated DNA 
sequenator) . If desired, polymerase chain reaction 
(PCR; U.S. Patent Nos. 4,683,202, 4,683,195 and 
4,889,818; Gyllenstein et al., 1988, Proc. Natl. Acad. 
Sci. USA 85:7652-7656; Ochman et al., 1988, Genetics 

30 120:621-623; Loh et al., 1989, Science 243:217-220) 
can be done prior to sequencing in order to amplify 
the region of interest. 

Southern blot hybridisation can also be 
carried out using a hybridization probe containing the 

35 desired vaccinia sequences to ensure that the HBV 
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sequences have been inserted into the desired region 
of the vaccinia genome* 

Protein analyses, to confirm expression of 
the encoded HBV epitope (s) by the selected recombinant 
5 vaccinia virus, can be carried out by any method known 
in the art, including functional or immunological 
assays, and are preferably accomplished by immunoassay 
methods employing antibodies to the encoded HBV 
epitope (s). Various immunoassays known in the art can 
10 be used, including but not limited to competitive arid 
non-competitive assay systems using techniques such as 
radioimmunoassays, ELISA (enzyme linked immunosorbent 
assay), M sandwich M immunoassays, immunoradiometric 
assays, gel diffusion precipitin reactions, 
15 immunodiffusion assays, in situ immunoassays (using 
colloidal gold, enzyme or radioisotope labels, for 
example) , western blots , immunoprecipitation 
reactions, agglutination assays (e.gr., gel 
agglutination assays, hemagglutination assays) , 
20 complement fixation assays, immunofluorescence assays, 
protein A assays, and Immunoelectrophoresis assays, 
etc. In one embodiment, antibody binding is detected 
by detecting a label on the primary antibody. In 
another embodiment, the primary antibody is detected 
25 by detecting binding of a secondary antibody or 
reagent to the primary antibody. In a further 
embodiment, the secondary antibody is labelled. Many 
means are known in the art for detecting binding in an 
immunoassay and are envisioned for use. 
30 In specific embodiments, western blot 

analyses can be carried out as described in Section 
8.2. infra* ELISA can be carried out as described in 
Section 8.3 infra. Several immunoassays for HBV s or 
e antigens are commercially available (Abbott 
35 Laboratories, No. Chicago, IL) . 



17:57:02 



WO 94/12617 



PCT/US93/11474 



Recombinant viruses containing a combination 
of epitopes from core and surface elements are a 
preferred embodiment of the invention and can be 
generated using the plasmids described in the examples 
5 sections infra, and homologous recombination. 

5.4. DETERMINATION OF VACCINE EFFICACY 

Immunopotency of the HBV epitope (s) in their 
live vaccinia vaccine formulation can be determined by 
10 monitoring the immune response of test animals 

following immunization with the recombinant vaccinia 
virus (es) expressing the HBV epitope (s). Generation 
of a humoral response may be taken as an indication of 
a generalized immune response, other components of 
15 which, particularly cell -mediated immunity, may be 
important for protection against HBV. Test animals 
may include mice, rabbits, chimpanzees and eventually 
human subjects. HBV can be made to infect chimpanzees 
experimentally, although they do not develop chronic 

2 0 infections. However, since chimpanzees are a 

protected species, the antibody response to an HBV 
vaccine of the invention can first be studied in a 
number of smaller, less expensive animals, with the 
goal of finding one or two best candidate viruses or 
25 best combinations of viruses to use in chimpanzee 
efficacy studies. 

Methods of introduction of the vaccine may 
include oral, intracerebral, intradermal, 
intramuscular, intraperitoneal , intravenous, 

3 0 subcutaneous, intranasal or any other standard routes 

of immunization. The immune response of the test 
subjects can be analyzed by various approaches such 
as: the reactivity of the resultant immune serum to 
HBV antigens, as assayed by known techniques, e.g., 
35 enzyme linked immunosorbent assay (ELISA) , 
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immunoblots , radioimmunoprecipitations , etc . ; or 
protection from HBV infection and/or attenuation of 
hepatitis symptoms in immunized hosts. 

In a specific embodiment, recombinant 
5 vaccinia virus vaccines of the invention can be 

prepared in large quantities for animal studies. By 
way of example, this is done by infecting roller 
bottles containing 10* to 10* cells with virus, and 
harvesting after several , days of infection. The cells 
10 are pelleted, washed, and then disrupted by 

resuspension and homogenization in a hypotonic buffer. 
Nuclei and cellular debris are then pelleted and the 
virus-containing supernatant is sonicated to break up 
any virus aggregates. The viruses are then pelleted 
15 through a cushion of 36% sucrose in a hard spin, 

followed by banding in a 24-40% sucrose gradient. The 
purified virus is then harvested by a hard spin, 
resuspended in a small volume, and titrated to 
determine viral concentration and yield. This process 
SO may be scaled up using a tangential flow filtration 
apparatus, and a continuous flow homogenizer. 

As one example of suitable animal testing, 
HBV vaccines of the invention may be tested in rabbits 
for the ability to induce an antibody response to HBV 
!5 antigens. Hale specific-pathogen-free (SPF) young 

adult New Zealand White rabbits may be used. The test 
group of rabbits each" receives approximately 5 x 10* 
pfu (plague forming units) of the vaccine. A control 
group of rabbits receives an injection in 1 raM 
0 Tris-HCl pH 9.0 of the parental non-recombinant 

vaccinia virus or of a recombinant virus which does 
not express HBV antigens. 

Blood samples may be drawn from the rabbits 
every one or two weeks, and serum analyzed for 
5 antibodies to the HBV antigens using, e.g., a 
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radioimmunoassay (Abbott Laboratories) . The presence 
of antibodies specific for core protein or the surface 
antigens may be assayed using an EL. ISA. The sera may 
also be analyzed for antibodies to vaccinia, e.g., in 
5 an enzyme- linked immunoassay. Because rabbits may 
give a variable response due to their outbred nature, 
it may also be useful to test the vaccines in mice. 

Mice respond differently to HBV antigens 
depending on their H-2 (histocompatibility) type 

10 (Milich and Chisari, 1982, J. Immunol- 129:320-325). 
Because there is little information available 
regarding the expression of HBV antigens from 
recombinant vaccinia viruses in mice , an HBV— 
responsive strain of mouse must be chosen, and an 

IS appropriate dose of vaccinia virus must be 

administered to render the mouse strain responsive to 
recombinant viruses of the invention (see Section 8.4, 
infra) . 

Patas monkeys may be used to test for 
2 0 immunogenicity of HBV vaccine formulation, although 

challenge experiments cannot be carried out since they 
are resistant to HBV infection. Chimpanzees may be 
tested for HBV vaccine efficacy (e.g., challenge 
experiments) , since they can be infected by the virus. 
25 in a specific embodiment, monkeys each receive 

intradermally approximately 5 x 10* pfu of recombinant 
Wyeth virus. A control monkey receives Wyeth 
(control) virus intradermally. Blood is drawn weekly 
for 12 weeks, and serum is analyzed for antibodies to 
30 vaccinia, core, and S. 

5.5. VACCINE FORMULATION AMD ADMINISTRATION 
The purpose of this embodiment of the 
invention is to formulate a vaccine in which the 
35 iramunogen is one or several recombinant vaccinia 
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virus (es) that express (es) HBV epitope (s) so as to 
elicit a protective immune (humoral and/or cell 
mediated) response against HBV infections for the 
prevention of hepatitis, hepatoma and/or other 
5 undesirable correlates of HBV infection. 

Many methods may be used to introduce the 
vaccine formulations of the invention; these include 
but are not limited to oral, intradermal, 
intramuscular , intraperitoneal , intravenous , 
10 subcutaneous, intranasal routes, and via scarification 
(scratching through the top layers of skin, e.g., 
using a bifurcated needle) . In a preferred 
embodiment, scarification is employed; intravenous 
administration is disfavored, since it is preferred to 
15 have the virus replicate in the skin and not spread 
systemically. 

The vaccine formulations of the invention 
comprise the recombinant vaccinia virus and a 
pharmaceutical ly acceptable carrier or excipient. The 
20 recombinant virus can be replicating or 

nonreplicating, although replicating is preferred. 
Similarly, an enveloped virus is preferred over the 
non-enveloped form. Pharmaceutically acceptable 
carriers include but are not limited to saline, 
25 buffered saline, dextrose, water, glycerol, sterile 
isotonic aqueous buffer, and combinations thereof. 
The formulation should suit the mode of 
administration. 

The composition, if desired, can also 
50 contain minor amounts of wetting or emulsifying 

agents, or pH buffering agents. The composition can 
be a liquid solution, suspension, emulsion, tablet, 
pill, capsule, sustained release formulation, or 
powder. Oral formulation can include standard 
*5 carriers such as pharmaceutical grades of mannitol, 
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lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, etc. 

Generally, the ingredients are supplied 
either separately or mixed together in unit dosage 
5 form, for example, as a dry lyophilized powder or 
water free concentrate in a hermetically sealed 
container such as an ampoule or sachette indicating 
the quantity of active agent. Where the composition 
is administered by injection, an ampoule of sterile 
10 diluent can be provided so that the ingredients may "be 
mixed prior to administration* 

In a specific embodiment, the lyophilized 
recombinant vaccinia virus of the invention is 
provided in a first container; a second container 
i5 comprises diluent consisting of an aqueous solution of 
50% glycerin, 0.25% phenol, and an antiseptic (e.g., 
0.005% brilliant green). 

The precise dose of virus to be employed in 
the formulation will also depend on the route of 
2 0 administration, and the nature of the patient, and 
should be decided according to the judgment of the 
practitioner and each patient's circumstances 
according to standard clinical techniques. However, a 
suitable dosage range for administration via 
25 scarification with a bifurcated needle is 10 6 to io 9 
pfu/ml (see, e.g., Cooney et al., 1991, The Lancet 
337:567-572, regarding recombinant vaccinia viruses 
expressing HIV proteins) . Boosting is possible but 
not preferred. If boosting is desired, it may be 
30 preferable to boost with the HBV antigen in purified 
form rather than using a recombinant virus of the 
invention, since long-lasting immunity to vaccinia 
virus may limit the utility of recombinant vaccinia 
virus for repetitive boosting (see id.). Effective 
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doses may also be extrapolated from dose-response 
curves derived from animal model test systems. 

The invention also provides a pharmaceutical 
pack or kit comprising one or more containers filled 
5 with one or more of the ingredients of the vaccine 
formulations of the invention. Associated with such 
container (s) can be a notice in the form prescribed by 
a governmental agency regulating the manufacture, use 
or sale of pharmaceuticals or biological products, 
10 which notice reflects approval by the agency of 

manufacture, use or sale for human administration. 



5.6. USE OF ANTI-HBV ANTIBODIES 
GENERATED BY THE VACCINES 
OF THE INVENTION 



15 



20 



25 



30 



35 



The antibodies generated against HBV by 
immunization with the recombinant viruses of the 
present invention also have potential uses in 
diagnostic immunoassays, passive immunotherapy, and 
generation of antiidiotypic antibodies. 

The generated antibodies may be isolated by 
standard techniques known in the art (e.g., 
immunoaf f inity chromatography, centrifugation, 
precipitation, etc.) and used in diagnostic 
immunoassays. The antibodies may also be used to 
monitor treatment and/or disease progression. Any 
immunoassay system known in the art, such as those 
listed supra, may be used for this purpose including 
but not limited to competitive and noncompetitive 
assay systems using techniques such as 
radioimmunoassays, ELISA (enzyme-linked immunosorbent 
assays) , "sandwich" immunoassays, precipitin 
reactions, gel diffusion precipitin reactions, 
immunodiffusion assays, agglutination assays, 
complement-fixation assays, immunoradiometric assays, 
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fluorescent immunoassays, protein A immunoassays and 
immunoelectrophoresis assays, to name but a few. 

The vaccine formulations of the present 
invention can also be used to produce antibodies for 
5 use in passive immunotherapy, in which short-term 

protection of a host is achieved by the administration 
of pre-fonned antibody directed against a heterologous 
organism (in this case, HBV) . 

The antibodies generated by *the vaccine 

10 formulations of the present invention can also be used 
in the production of antiidiotypic antibody. The 
antiidiotypic antibody can then in turn be used for 
immunization, in order to produce a subpopulation of 
antibodies that bind the initial antigen of the 

15 pathogenic microorganism (Jerne, 1974, Ann. Immunol. 

(Paris) 125c:373; Jerne, et al., 1982, EMBO J. 1:234). 

5.7. THERAPEUTIC USE OF THE RECOMBINANT 
VACCINIA VIRUSES OF THE INVENTION 

20 In this embodiment, the recombinant vaccinia 

viruses of the invention, encoding HBV immunogenic 
epitope (s), are administered therapeutically, for the 
treatment of hepatitis or other undesirable correlates 
of HBV infection. Administration of such viruses, 

2 5 e.g. , to neonates and other human subjects, can be 
used as a method of immunostimulation , to boost the 
host^ immune system, enhancing cell-mediated and/or 
humoral immunity, and facilitating the clearance of 
HBV. The viruses of the invention can be administered 

30 alone or in combination with other anti-viral 

therapies, including but not limited to or-interf eron 
and vidarabine phosphate - 



35 
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6. CONSTRUCTION OP VECTORS 
ENCODING HBV EPITOPES 

Some of -the' vectors which were constructed 

to contain DNA sequences encoding HBV surface antigen 

and/or core epitopes, under the control of a promoter 

functional in vaccinia virus, are listed in Table 3, 

along with their encoded antigens and the promoters 

operably linked to the HBV DNA. These vectors and 

their construction are described in the subsections 

below . 



TABLE 3 



Plasmids Containing Sequences Encoding 
HBV Epitopes Under the Control of a 
Promoter Functional in Vaccinia Virus 



25 



30 







PROMOTER 




PLASMID 


p7.5 


pll 


modified pJJS 


pSCIO 




0-galftctostdase 




pGS53/S 


S 






pGS53/S2 


MS 






pGS53/SI 


LS 






pHTL-5 




coie(ftitl-lengtb) 




pHTL-6 




core(full-length) 


S 


pHTL-7 




core(fuU-length) 


LS 


pHTL-8 






S 


pHTL-9 






LS 


pHTL-10 






MS 


pHTL-IOM 






MS (upreg. ATG) 


pHTL-11 




c©re(fu II - length) 


MS 


pHTL-12 




core(A8) 


MS 


pHTL-13 


core(fuH-length) 


S 
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lO 



15 



25 



30 



PLASMID 


PROMOTER 


p7.5 


pi 1 


modified p/.D 


pHTL-14 


preS l-core{AB) 




MS 


pHTL-15 


core(fuu*leogtn) 




S 


pHTL-17 


core(fu U-leo gth) 




MS 


nTTTT - 1 ft 


preS 1 -core(full-I ength) 




MS 


„uTf *y\ 




preSl-core(A8) 


1 


pjlll^ZA 




preS 1 -coreCfuU-leogth) 




ptl 1 






coretfuU-length) 








cor»-preSl * 


pn i l^z. i 






core~preS2 


pn i L*r^.o 






core-S* 


oHTI -30 


core(A8) 




MS (upreg. ATG> 


pHTL-31 


preS 1 -core(A8) 




MS (upreg. ATG) 


pHTL-32 


prco i -core\Aoj 






pHTL-33 


coi^juii-ienginj 




Mo (upreg. AlO) 


pHTL-34 


core(full*lerjgtb) 




c 
o 


pHTLOS 


core(fuII-length) 




LS 


pHLT-36 


preS 1 -coreCftiH-length) 




MS (upreg. ATG) 


pRO-10 




S 




pRO-II 


MS 


S 




P RO-13B 


LS 






pRO-16 


core(A8) 


S 




pRO-16M 


core(A8) 






pRO-17 


MS 


core(A8) 




pRO-18 


LS 


core(A8) 




pRO-19 




core(A8) 
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PLASM ID 


PROMOTER 




pll 


modified p7.5 


pRO-19/24 




core(ftiU-len£tb) 




pRO-21 


MS 


preS]-core(A8) 




pKO-22 


MS 


core{fuU-length) 




pRO-23 


MS 


preS 1 -coreCfult-length) 





Other vectors which were constructed , and 
which have utility in generating the recombinant 
viruses of the invention upon further manipulation 
(with respect to the cloning vectors) , or with a 
plasmid that provides T7 polymerase (with respect to 
HBV sequences operably linked to a T7 promoter) are 
also described Infra . 



6.1. SUBCLONING OF HBV GENES 
The HBV genome, showing the core and surface 
antigen-encoding regions, is illustrated schematically 
in Figure 8. Using standard recombinant DNA 
techniques (see, for example, Sambrook et al., 1989, 
Molecular Cloning, A Laboratory Manual, 2nd ed., Cold 
Spring Harbor Press, New York), each of the four HBV 
genes were subcloned separately from the cloned HBV 
genome and placed in individual plasmids such that 
multiple restriction enzyme digestion sites surrounded 
each gene and provided for easy manipulation. A 
method used to make these subclones is described 
below. Unless otherwise mentioned, all DNA-modifying 
enzymes were obtained commercially from New England 
Biolabs (Beverly, MA) or BRL Life Technologies, Inc. 
(Gaithersburg, MD) , and were used according to the 
manufacturer 1 s instructions . 
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6.1.1. SURFACE ANTIGENS 
The HBV genome (subtype adw) is contained in 
plasiuid pAM6 (Moriarty et al., 1981, Proc. Natl. Acad. 
Sci. USA 78:2606-2610, available from the American 
5 Type Culture Collection (ATCC) Accession No. 45020, 
Fig. 9} , a pBR322-based vector that is replicable in 
E+ coll. Since the MS/LS open reading frames (ORFs) 
in pAM6 are interrupted by pBR322 DNA, these ORFs were 
resected using the 2883 bp cloning vector pT7T318 

10 (Pharmacia cat. no. 27-3512; Fig. 10), which contains 
multiple restriction sites. 

The 5 • end of the open reading frame coding 
for the three surface antigens, which is bounded by 
BamHI and Bglll restriction endonuclease sites in 

15 pAM6, a fragment of length 799 bp, was removed from 
pAM6 and inserted into BamHI-linearized pT7T318 to 
generate plasmid pRO-01 (Fig- 11) . Briefly, pAM6 
plasmid DNA was double digested with BamHI and Bglll 
enzymes, and the fragments generated by the digestion 

2 0 were electrophoretically separated on an 0.8% agarose 
gel with appropriate size markers in IX TBE (89.2 mM 
Tris pH 8.0, 89 mM boric acid, 2 mM EDTA) , and the 
fragment corresponding to the 799 bp insert containing 
preSl and part of preS2 was excised from the 

25 ethidium-bromide stained gel. This fragment was 

eluted from the gel by electrophoresis into a dialysis- 
bag in 0.5X TBE and ligated, using T4 DNA ligase, to 
the pT7T318 DNA which had been linearized by BamHI 
digestion, similarly gel-purified, and treated with 

30 bacterial alkaline phosphatase (BAP) to prevent 

recircularization of the vector. A fraction of the 
ligation mix was transformed into E~ coll (strain 
DH5oc F'I Q , available from BRL Life Technologies, Inc., 
cat. no. 8288SA) which had previously been made 

35 competent to take up the DNA by standard techniques 
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(Sambrook et al., supra)* The transformed cells were 
plated on plates of LB agar plus 50 vg/ml ampicillin 
to select for cells which had been transformed with 
the plasmid. Small 1 ml cultures of LB broth (10 g/1 
5 bactotryptone, 5 g/1 bacto-yeast extract, 10 g/1 NaCl 
pH 7.5) plus 50 /ig/ml ampicillin were then grown. DNA 
was prepared from these cultures by a boiling lysis 
miniprep protocol (Sambrook et al,, supra) followed by 
digestion and electrophoresis on an agarose gel to 

10 screen for a colony containing an insert of the 

expected size and orientation. Specifically, the size 
and desired orientation of the insert was verified by 
digestion with PstI, which cuts in the polylinker of 
PT7T318 and within the insert, 5 bp from the 3 • end of 

15 the preS2 region. In the desired orientation, 

digestion of pRO-0l with PstI yields fragments of 
812 bp and 2870 bp, consistent with the PstI site of 
the pT7T318 polylinker being at the N-^terminus of the 
open reading frame. Large quantities of the desired 

20 DNA and a glycerol stock of the plasmid-harboring E~ 
coll were prepared by standard methods from a 250 ml 
LB broth plus ampicillin culture of the chosen colony. 



surface antigens was manipulated using similar 
25 techniques to connect it to the 5* end of the ORF. 

Specifically, the 3 '-end fragment with length 1372 bp 
wa excised from pAM6 by BamHI digestion and purified 
from an agarose gel. This fragment was inserted into 
BamHI-linearized and bovine alkaline phosphatase 
30 (BAP) -dephosphorylated pRO-01, a fragment of 3683 bp, 
to generate pRO-02 (Fig. 12), which contains an intact 
open reading frame capable of encoding LS, MS and S 
. antigens. The correct orientation of the insert in 
pRO-02 was verified by double digestion with PstI and 



The 3 • end of the ORF coding for the three 



35 
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Xbal to yield a restriction pattern consistent with 
fragments of sizes 4018 , 800, 224 and 12 bp. 

In order to achieve a high level of 
expression of the genes encoding the LS, MS, and S 
5 antigens, each gene was engineered so as to be under 
the transcriptional control of a vaccinia or 
vaccinia-like promoter, i.e., a promoter which is 
recognized by vaccinia virus KNA polymerase. in 
nature,- the 3 f end of the vaccinia promoter is often 
10 very close to, in the case of early promoters, or 
superimposed on, in the case of late promoters, the 
translation initiation codon of the ORF (1990, 
Virology, Fields, B.N. , ed. , Raven Press, New York), 
so we deemed this proximity also desirable in 
15 recombinant constructs. With this goal in mind, the 
individual s, MS, or LS ORFs were subcloned back into 
pT7T318 to place restriction endonuclease sites close 
to the initiating ATG of each ORF. Using these sites, 
the vaccinia or vaccinia-like promoters were placed 
2 0 very close to the initiating ATG of each ORF. 

The trans lational initiation codons of the 
MS and S ORFs are contained within a 321 bp Hindi 
fragment of pRO-02. This Hindi fragment was isolated 
from pRO-02 and cloned into Hincll-linearized and 
25 BAP-treated pT7T318 to generate pRO-03 (Fig. 13). The 
correct orientation of the inserted fragment was 
verified by digestion of pRO-03 with BamHI to yield a 
restriction pattern consistent with fragments of 201 
bp and 3003 bp, indicating that the insert had been 
30 made such that the BamHI site in the pT7T318 

poly linker was located at the 3» end of the insert. 

From pRO-03, the initiating ATG of the S ORF 
was subcloned so as to be in close proximity to useful 
restriction sites. Specifically, pRO-03 was digested 
35 with Nlalll, which digests the plasmid in many places 
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(11 sites) including within the MS-encoding region one 
base pair upstream of the initiating ATG of the S ORF. 
N lain -digested pRO-03 was also digested with Hindi, 
which cuts the plasmid at the 3 • end of the S-f ragment 
5 insert. The 67 bp ATG-containing fragment generated 
by. this Nlalll/HincII digestion was then isolated from 
a 5.5% polyacrylamide gel run with appropriate size 
standards and then cloned into the vector backbone of 
PT7T318 which had been double digested with SphX and 

10 Hindi, generating pRO-05 (Fig. 14) , which contains 
the initiating ATG of the S ORF. The presence of the 
insert in pRO-05 was verified by double digestion with 
Hindlll and EcoRI to yield a restriction pattern 
consistent with fragments of lengths 103 and 2871 bp. 

IS The orientation of the insert in pRO-05 was verified 
by restriction digestion to be such that the Hindi 
site at the carboxyl- terminus of the inserted ORF is 
retained. 

In a similar manner, the initiating ATG of 
2 0 MS was subcloned so as to be closely surrounded by 

restriction sites. Specifically, pRO-03 was digested 
with Haelll, which cuts pRO-03 at 12 sites including 
one just two base pairs 5» of the initiating ATG of 
MS, together with Hindi. The resulting 232 bp 
25 fragment containing the 5 1 end of the MS ORF was 
isolated and subcloned into the 2866 bp vector 
backbone of pT7T318 which had been double digested by 
Smal and Hindi, BAP-treated, and gel-purified, 
generating pRO-06 (Fig. 15) . The size and orientation 
30 of the insert were verified by double digestion with 
Hindi and EcoRI to yield a restriction pattern 
consistent with fragments of length 248 bp and 2850 
bp. 

The . 3 • end of the s antigen coding region 
35 was then added to the S ORF in pRO-05 by Hindi and 
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StuI double digestion of pRO-02 and isolation of the 
750 bp fragment coding for this C-terminal region - 
This fragment was inserted into the 2918 bp Hindi and 
Smal double digested vector fragment of pRO-05 to 
5 generate pT7T3/S (Fig. 16). Restriction analysis was 
used to confirm that the orientation of the insert in 
pT7T3/S was such that the Hindi site within the S ORF 
had been recreated. pT7T3/S thus contains DNA 
encoding the entire open reading frame of the S 

10 antigen, beginning with the initiating ATG. 

In a similar manner, the 3 • end of the S 
antigen was added to the MS ORF in pRO-06 by double 
digesting pRO-02 with Hindi and SphI, isolating the 
fragment corresponding to the 1016 bp 3' end, and 

15 inserting this fragment into the 3085 bp vector 

fragment of Hindi and SphI double digested pRO-06, 
generating pT7T3/S2 (Fig. 17). Restriction analysis 
was used to verify that the insert's orientation in 
pT7T3/S2 was such that the Hindi site within the S 

20 region had been recreated. pT7T3/S2 created in this 
manner contains DNA encoding the entire open reading 
frame of the MS antigen, beginning with the initiating 
ATG of MS. 

A plasmid construct containing the I*S open 
25 reading frame closely surrounded by restriction 

endonuclease sites was constructed using a similar 
procedure. The 509 bp Hindi fragment of pAM6 
containing the LS ATG was subcloned into 
Hindi-linearized pT7T318, generating pLEH-01 (Fig. 
30 18). pLEH-01 was then double digested with Nlalll, 

which cuts 10 times including at a site 1 base pair 5" 
of the initiating ATG of LS , and with Hindi at a site 
approximately 230 base pairs 3'* of the ATG. This 
fragment of 234 bp, containing the 5' end of the LS 
35 ORF, was subcloned into the 2869 bp vector backbone of 
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pT7T318 which had been double digested with SphI and 
Hindi to give pLEH-02 (Fig- 19). Restriction 
analysis was used to confirm the size of the insert in 
pLEH-02 and that the insert was oriented so as to 
5 recreate the Hindi site. The 132 bp middle fragment 
of the I>S ORP stretching from within preSl to within 
preS2 was then subcloned from HincII/BamHI-cut pRO-02, 
into the 3091 bp vector backbone of pLEH-02 which had 
been double digested with Hindi and BamHI , generating 
10 pLEH-03 (Fig. 20) . Restriction digestion was used to 
verify that the insert in pLEH-03 was oriented so as 
to restore both the Hindi and BamHI sites, and that 
an insert of the expected size had been made. 
Finally, the 938 bp portion of the LS ORF coding for 
15 the C- terminus of I*S was subcloned from pRO-02 which 
had been BamHI / StuI —digested , into the 3218 bp vector 
backbone of BamHI/Smal-digested pI#EH-03, generating 
pLEH-04 (Fig. 21) . . Restriction digests were used to 
confirm the size of the insert in pLEH-04 , and that 
20 the insert had been made so as to reconstruct the 
BamHI site within the preS2 region. 

To verify the DNA sequences of the subcloned 
S, MS, and I*S regions, the dideoxy method (e.g., 
Seguenase kit, United States Biochemical Corp. ; Tabor 
25 and Richardson, 1989, J. Biol. Chem. 214:6447-6458) 

was used to sequence double stranded DNA from pT7T3/S, 
PT7T3/S2, and pI*EH-04 inwards from primers (M13 Single 
Strand Primer, Pharmacia LKB cat. no. 27-1534-03, and 
M13 Reverse Sequence Primer, Cat no. 27-1532-01) 
30 complementary to both ends of the polylinker. In 

addition, a primer hybridizing just 3* of the ATG of 
the s ORF was used to facilitate sequencing of the 5 1 
ends of the MS- and LS-encoding sequences. In order 
to sequence the interior of the S coding region, a 530 
35 bp internal Ball/Dral fragment of the S region was 
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subcloned into pT7T318 which had been linearized with 
Hindi. Sequence data generated using this strategy 
was analyzed in comparison to the published sequence 
of one isolate of subtype adw DNA (Ono et al. , 1983, 
5 Nucl. Acids Res. 11:1747-1757). When we performed 
these experiments , our sequence data showed three 
translatable open reading frames corresponding to DNA 
encoding LS, MS and S. The nucleotide and deduced 
amino acid sequences of the I*S (SEQ ID NO: 5 and 

10 NO: 6), MS (SEQ ID NO: 7 and NO: 8), and S (SEQ ID 

NO: 9 and NO: 10) regions are shown in Figures 22, 23, 
and 24, respectively. Changes in nucleotide and amino 
acid sequence from the published sequence are noted in 
Table 4 and are believed to result from variability 

15 within the adw subtype. 

TABLE 4 

Comparison of Published HBV Surface ORFs (subtype adw)* 
and Surface ORFs in pLEH-04 

20 



25 



30 



Base of Change 
Relative to A of ATG 


Region of 
Change 


Published 
Triplet 


PubBshed 
Amino Add 


pLEH-04 
Triplet 


pLEH-04 
Amino Acid 


27 


preSI 


A A C 


Asn 


A AT 


Asn 


40 


preSI 


CTT 


Leu 


TTT 


Phe 


83. 84 


preSI 


AC A 


Thr 


A A C 


Asn 


166 


preSI 


CTC 


Leu 


TTC 


Phe 


187 


pre SI - 


ATT 


He 


G TT 


Val 


336 


preS2 


A AT 


Asn 


A AC 


Asn 


348 


preS2 


TTG 


Leu 


TTC 


Phe 


388 


preS2 


CTT 


Leu 


TTT 


Arg 


488 


preS2 


ATC 


Ue 


A AC 


Asn 


887 


S 


A AG 


Lys 


ATG 


Met 


954 


S 


TC C 


Ser 


TCT 


Ser 



Ono, T. et al.. Nucleic Acids Research 11:1747-1757 (1983). 



35 
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6.1.2. CORE ANTIGENS 
Hepatitis B virus core antigen-encoding 
sequences were also siibcloned and engineered so as to 
be transcriptionally controlled by a vaccinia or 
5 vaccinia-like promoter. Like the surface antigen 
constructs, the hepatitis B core antigen constructs 
were designed so as to maintain a close proximity 
between the vaccinia promoter and the translational 
initiation codon of the core open reading frame. A 
10 gene encoding a full-length core was used. In 

addition, a deleted version of the core gene, referred 
to as coreAS, in which 8 amino acids are deleted as 
set forth below starting 16 amino acids from the 3 • 
end of the full-length core antigen, was used. 
15 CoreA8 ; 

pAM6 was digested with Styl at two sites, 
one 16 bp upstream of the initiating ATG of core and 
the second 5 bp downstream of the C-terminal end of 
the core antigen coding region, and the overhanging 

20 single-stranded ends were blunted by treatment with 
Klenow fragment of DNA polymerase I by standard 
protocols. This 584 bp fragment was then gel-purified 
and subcloned into Hindi-linearized pT7T318, 
generating a new plasm id, named pRO-09B (Fig. 25) , 

25 containing three Bglll sites. The orientation of the 
insert in pRO-09B was determined to be such that the 
EcoRI site in the polylinker of pT7T318 is located at 
the C-terminus of the open reading frame; digestion 
with Bglll and EcoRI yielded a restriction map 

30 consistent with fragments of 24, 68, 420, and 2954 bp. 

The 24 bp coreAS deletion was generated 
using a partial Bglll digest- By adjusting reaction 
conditions, it was possible to limit the action of 
Bglll such that some of the product of the reaction 

35 was cut only at one of the three Bglll sites in core. 
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This linearized plasmid, which consisted of a mixture 
of the pRO-09B plasmid which had been linearized at 
BgllX site #1, Bglll site #2, and Bglll site #3, was 
then gel- purified. This linear product was then 
5 digested to completion with Smal at the 3 1 end of the 
polylinker, and the large 3390 bp and 3414 bp vector 
fragments corresponding to initial linearization at 
either Bglll #2 or Bglll #3 were gel-purified. The 
158 bp Bglll/HincII fragment of pAM6 containing the 

10 C-terminus of the core ORF was then subcloned into the 
purified vector mixture. The resultant minipreps were 
screened for the presence of a 420 bp band upon Bglll 
digestion, indicative of insertion in the correct 
orientation in which the Bglll site has been 

IS recreated. The core regions of a number of candidate 
minipreps were sequenced to identify a clone in which 
Bglll #2 was the site of digestion in the initial 
partial digest. The resulting plasmid, called pT7T3/C 
(Fig. 26) , contained the expected 24 bp Bglll-Bglll 

2 0 deletion from bases 508 to 531 of the core ORF, and 

also had an unexpected single base pair deletion of an 
adenine residue at base 237 of 555 total base pairs 
within the full-length core open reading frame in 
comparison with the published sequence (Ono et al., 

25 supra) . Since this deletion would result in a 

frame-shift upon translation of the core RNA, a method 
of reinserting this base into the open reading frame 
was used, as described below. Besides our intentional 
24 bp deletion and this unintentional deletion at base 

30 237, we found 11 additional nucleotides which differed 
from the published wild-type core sequence, which are 
summarized in Table 5. Only one of these nucleotide 
changes results in a change in -amino acid sequence. 
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TABLE 5 

Comparison of Published HBV Core, (subtype adw)* 
and Core in pCODM/24 



5 



10 



15 



Base of Change 
Relative to A of ATO 


Published 
Triplet 


Published 
Aminp Add 


pCODM/24 
triplet 


pCODM/24 
Amino Acid 


81 


GTA 


Val 


GTC 


Val 


82 


CG A 


Arg 


AG A 


Arg 




C G A 


Arg 


C G G 


Arg 


156 


CAT 


His 


C AC 


His 


159 


ACT 


Thr 


ACC 


Thr 


229, 231 


C A A 


Gin 


GAG 


Glu 


243 


TC C 


Ser 


TC A 


Ser 


246 


A G A 


Arg 


AG G 


Arg 


288 


A A G 


Lys 


AAA 


Lys 


291 


ATC 


He 


ATT 


Me 


360 


GTC 


Val 


GTA 


Val 



* Ono, T. et at. Nucleic Acids Research 11:1747-1757 (1983). 



20 To reinsert the missing adenine at base 237 

of the core ORF, pT7T3/C was subjected to 
oligonucleotide-directed site-specific mutagenesis on 
double-stranded DNA by published techniques (see for 
example Inouye, s. and Inouye, M. , 

25 "Oligonucleotide-directed Site-specific Mutagenesis 
Using Double-Stranded Plasmid DNA M , in Synthesis and 
Applications of DNA and UNA, ed. S. Narang, Academic 
Press) . (Polymerase chain reaction could also be used 
to reinsert the missing nucleotide.) An. 

30 oligonucleotide of sequence 5 1 GGA GGA TCC aGC ATC AAG 
G 3 1 (SEQ ID NO: 11) was used both to replace the 
deleted base pair (lower case H a w ) and to generate for 
cloning purposes, without changing the core protein's 
amino acid sequence, a BamHI site (underlined) several 

35 bases 5 9 of the missing base pair. Mutation using 
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this procedure generated two types of DNA plasmids: 
wild-type and mutant containing the new BamHI site and 
the inserted adenine. The mutant DNA was then cloned 
selectively using this BamHI site, such that each of 
5 the two pieces of DNA on either side of the BamHI site 
were first cloned individually and then reunited. To 
do this, the transformed E~ coll containing a mixture 
of mutated and non-mutated plasmids were inoculated 
into 250 ml LB broth with 50 vg/ml ampicillin, and a 
10 purified large-scale preparation of the DNA was made 
by standard techniques (Sambrook et al., supra). 
Separate aliquots of this DNA were then digested with 
either EcoRI / BamHI or Hindlll/BamHI. Part of the 
EcoRI /BamHI-digested aliquot was subjected to 
15 electrophoresis on an agarose gel, and a 448 bp 

fragment was excised and purified. The same method 
was used to purify the 269 bp Hindlll/BamHI fragment. 
Use of this method generated two fragments which were 
then used to make two plasmids: pT7T3/CHB (Fig. 27), 
20 in which the 269 bp 5* Hindlll to BamHI region is 
cloned into the 2853 bp vector fragment of 
Hindlll/BamHI digested pT7T318, and pT7T3/CRB (Fig. 
28), in which the 448 bp 3* BamHI to EcoRI region is 
cloned into the 2862 bp vector fragment of 
25 BamHI/EcoRI -digested pT7T318. In both cases, the 

inserts were verified by restriction digestion to be 
oriented so as to recreate both restriction sites. 

To reunite the core open reading frame, the 
448 bp BamHI / EcoRI fragment of pT7T3/CRB containing 
30 the 3 • end of the mutated open reading frame was then 
cloned into the 3101 bp fragment of BamHI / EcoRI— 
digested pT7T3/CHB so as to preserve both the BamHI 
and EcoRI sites, generating pT7T3/CODM (Fig. 29) , 
which contains a complete coreA8 gene containing the 
35 desired single base pair insertion. The nucleotide 
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sequence (SEQ ID NO: 12) and deduced amino acid 
sequence (SEQ ID NO: 13) of the coreA8 ORF in 
pT7T3/CODM are shown in Figure 30. 
Full-length core ; 
5 in order to construct: a full-length core 

gene, pT7T3/CRB was digested with Bglll, which 
linearizes the plasmid, and ligated to a pair of 
oligonucleotides designed to hybridize to each other, 
add 24 bp of sequence, and leave overhanging 
10 Bglll-compatible ends. Oligonucleotides of sequence 
5» GAT CCC AAT CGC GGC GTC GCA GAC 3» (SEQ ID NO: 16) 
and 5 1 GAT CGT CTG CGA CGC GGC GAT TGG 3 1 (SEQ ID 
NO: 17), were used for this purpose, and were designed 
to make changes in the nucleotide sequence at each end 
15 of the insert so as to destroy both Bglll sites but to 
still encode the correct amino acid sequence. By 
digesting the religated plasmid with Bglll, selection 
was imposed for an insertion mutant, pCRB/24 (Fig. 
31), which lacks Bglll sites. pCRB/24 is similar to 
20 pT7T3/CRB except that pCRB/24 encodes 8 additional 
amino acids of core antigen sequence and lacks any 
Bglll sites. The correct orientation of the insert to 
recreate the authentic core amino acid sequence, and 
the fact that only a single insert had been made, was 
25 confirmed by dideoxy sequencing of pCRB/24. To 

reconstruct the full-length core gene, pCRB/24 was 
digested with EcoRI and BamHI, and the 472 bp fragment 
containing the 3 ■ end of the core gene was subcloned 
into the 31D1 bp vector fragment of 
30 EcoRI/BamHI-digested pT7T3/CHB to generate plasmid 
pCODM/24 (Fig. 32; Table 5, supra). The orientation 
of the insert in pCODM/24 was such that both the BamHI 
and EcoRI sites were recreated/ pCODM/24 is identical 
to pT7T3 /CO DM except that pCODM/24 contains 24 bp 
35 which are deleted in pT7T3/CODM. The nucleotide 
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sequence (SEQ ID NO: 14) and deduced amino acid 
sequence (SEQ ID NO: 15) of the core region in pCODM/24 
are shown in Figure 33. 

6.2. SINGLE EXPRESSION VECTORS FOR 
S. MS, LS AND CORE ANTIGENS 

A variety of expression vectors for 

insertion into the vaccinia thymidine kinase gene was 

constructed in which p7.5 (Fig- 2), pll (Fig. 4), or 

the modified p7.5 promoter (Fig. 6) directed the 

expression of one of the cloned HBV genes. A number 

of constructs were made in which expression of a 

single HBV antigen was directed by the p7.5 promoter. 

To facilitate this, a unique Xhol restriction site was 

created downstream of the p7.5 promoter in pGS53. 

This was done by linearizing pGS53 with BamHI, 

blunting the ends using Klenow treatment, setting up a 

ligation reaction in the presence of an excess of an 

Xhol linker of sequence 5* CCC TCG AGG G 3" (SEQ ID 

NO: 18) (New England Biolabs, cat. no. 1072), heating 

to denature, and religating. A fraction of this 

reaction was used to transform E. coll (DH5oc F'I Q ), 

and minipreps of colonies were screened for the 

presence of a new Xhol site. The plasmid thus 

created, pGS53X (Fig. 34), contains two BamHI sites, 

one on each side of the Xhol site. 

The 824 bp S fragment -was then excised from 

pT7T3/S by digestion with Hindlll and Kpnl followed by 

treatment with T4 DNA polymerase in the presence of 

30 deoxyribonucleotides. This treatment served to fill 

in overhanging 5* ends and remove overhanging 3' ends. 

The 1251 bp MS fragment was excised from pT7T3/S2 by 

digestion with Kpnl and SphI followed again by T4 

polymerase treatment. The 1325 bp LS fragment was 

excised from pLEH-04 by digestion with Hindlll and 

EcoRI followed by filling in the overhanging 5» ends 



20 



25 



35 



17:57:02 




WO 94/12617 



PCT/US93/11474 



- 69 - 



with the Klenow fragment of DNA polymerase I. The 
ends of all three gel-purified fragments thus 
generated were made Xhol -compatible by addition of the 
Xhol linkers described above, which in this case had 
5 been phosphorylated using T4 polynucleotide kinase and 
standard procedures, followed by digestion with. Xhol. 
Each fragment was then subcloned into the 4704 bp 
Xhol-linearized and BAP-dephosphorylated pGS53X. The 
single antigen expression vectors thus- created 
10 contained the S gene, pGS53/S (Fig. 35), the MS gene, 
PGS53/S2 (Fig. 36), or the 1^ gene, pGS53/Sl (Fig. 
37), each under the control of the p7.5 promoter. The 
orientation of each insert was determined using 
conventional restriction mapping to be such that each 
15 open reading frame could be expressed from the p7.5 
promoter. Specifically, the correct orientation of 
the insert was confirmed by Hindi digestion to yield 
restriction maps consistent with fragments from 
PGS53/S of 352 and 5186 bp, from pGS53/S2 of 514 and 
20 5447 bp, and from pGS53/Sl of 321, 520 and 5202 bp. 
The genes in these plasmids were engineered so that 
the 3 • end of the p7 . 5 promoter in these vectors is 
within approximately 20 bp of the initiating ATG. In 
each of these constructs, the expression cassette 
25 region is flanked by sequence from the vaccinia virus 
thymidine kinase gene. This thymidine kinase gene is 
contained in the Hindlll-J fragment of the vaccinia 
genome . 



30 expression of S, MS, and 1^ antigens individually 
directed by the modified p7.5 promoter and designed 
for insertion into the vaccinia TK gene was made as 
follows. The s region from pT7T3/s and the LS region 
from pLEH-04 were amplified by polymerase chain 

35 reaction using techniques well known in the art 



A similar series of constructs for 
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(Erlich, H.A., ed., PCR Technology, Principles and 
Applications for DNA Amplification, M. Stockton Press, 
New York, 1989). One primer used was of sequence 5* 
CTC ACT CGA GGG AAT AAG CT 3* (SEQ ID NO: 19), which 
5 hybridizes from bases -23 to -4 relative to the 

initiating ATG in both plasmids. This primer contains 
a 2 bp change from the plasmid sequence, AA to CG at 
positions 7 and 8, which was designed to create an 
Xhol site (CTCGAG) upon amplification. Another primer. 

10 used, 5' TTG TTA GGC CTT_JVAA TGT AT 3' (SEQ ID NO: 20), 
hybridizes at the carboxy- terminal end of the open 
reading frames in both of these plasmids (underlined 
nucleotides indicate position of termination codon TAA 
on the complementary strand) . This primer contains a 

15 2 bp change from the plasmid sequence, GT to CC at 
positions 9 and 10, designed to create a StuI site 
(AGGCCT) upon amplification. The fragments generated 
by PCR of pT7T3/S and pLEH-04 with these primers are 
expected to be of sizes 714 and 1203 base pairs, 

2 0 respectively. These pieces were then digested with 
Xhol and StuI to remove the small fragments at either 
end, yielding fragments of size 699 and 1188 base 
pairs, respectively, which were purified from an 
agarose gel. These latter fragments were inserted 

25 into the 4 026 bp vector fragment of Xhol and StuI 
double digested pSC59 so as to reconstruct Xhol and 
StuI sites at each end of the insert. The 
S-containing construct named pHTL-8 (Fig. 38) , and the 
LS containing construct, pHTL-9 (Fig. 39), each 

30 consist of the modified p7.5 promoter situated so as 
to direct expression of the HBV antigen located 
downstream. 

A construct designed to allow expression of 
the MS antigen from the modified p7.5 promoter was 
35 made as follows. A 998 bp EcoRI-StuI fragment of 
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PT7T3/S2 was gel-purified and inserted into the 4032 
bp EcoRI-StuI digested vector fragment of pSC59 so as 
to reconstruct the EcbRI and StuI sites at either end 
of the insert. The resulting expression plasmid, 
5 pHTL-10 (Fig. 40) , contains the MS ORF located 

downstream of the modified p7.5 promoter. pHTL-10 
contains the sequence "CC GATG " surrounding the ATG of 
preS2. According to M. Kozak (1989, J. Cell Biol. 
108:229-241) , the DNA sequence leading to the 
10 strongest ribosome binding to a translation initiation 
codon is "AN N ATG " . where N is any nucleotide. In the 
absence of a strong consensus-like sequence 
surrounding the ATG of preS2, significant levels of 
expression of the S antigen from the MS-containing 
15 constructs may be observed by western blot. 

Polymerase chain reaction may be used to modify the 
surroundings of the preS2 ATG in pHTL-10 or other 
constructs to match this consensus sequence, as 
follows, with the goal of shifting expression from S 
20 to MS. 

A fragment of pHTL-10 containing the 
initiating ATG of preS2 was amplified by PCR, and by 
the proper choice of primers the sequence upstream of 
the ATG was made to match the consensus sequence 

25 described by Kozak. One primer used was of the 

sequence 5* GAG CTC GGT ACC ACC ATG AAG TGG A 3 • (SEQ 
ID NO: 21), which hybridizes from bases -15 to +10 
relative to the preS2 ATG and contains a Kpnl site 
(GGTACC) as well as the desired C to A change at 

30 position 13. A second primer used in conjunction 

with this first primer is of sequence 5 f GAG CAG GGG 
TCC TAG GAA TC 3' (SEQ ID NO: 22) . This second primer 
hybridizes to the opposite strand from the first 
primer, from bases +204 to +185 relative to the preS2 

35 ATG in pHTL-10, and contains an Avrll site (CCTAGG) . 



17:57:02 



WO 94/12617 PCT/US93/11474 

- 72 - 



The product of a PCR reaction using these primers was 
digested with Kpnl and Avrll, run on an agarose gel, 
and the 197 bp fragment containing the initiating ATG 
of MS was gel-purified. This fragment was cloned into 
5 the 4833 bp gel-purified vector fragment of pHTL-10 
generated by digestion with Kpnl and Avrll so as to 
retain both restriction sites. Dideoxy sequencing was 
used to confirm that the expected changes had been 
made upstream of the initiating ATG of preS2 in the 

10 new plasmid, pHTL-lOM (Fig. 41), to match the 

consensus sequence. Western blotting was used to 
compare the relative expression levels of MS vs. S in 
pHTL-10 and pHTL-lOM (see Section 8.2 infra). 
Analysis by Western blot probed with an anti-S 

15 polyclonal antibody of proteins expressed transiently 
from pHTL-lo and pHTL-lOM in vaccinia- infected cells 
showed that the amount of MS protein expressed from 
pHTI*-10M was significantly increased and the amount of 
S protein significantly decreased relative to that 

2 0 expressed from pHTL-10. 

6-3. SINGLE EXPRESSION VECTORS FOR 
CORE-S-REGION FUSIONS 

A series of expression constructs consisting 

25 of fusions of the core antigen to the preS2 region or 

to part of the S or preSl regions was also made. The 

fusion polypeptides- were expressed from the modified 

p7.5 promoter (Fig. 6). The first step in generating 

core to S-region fusions was to subclone the full- 

30 length core gene downstream of this promoter. This 

was done by digesting pCODM/24 with PstI, treating 

with mung bean nuclease (New England Biolabs) to 

remove single-stranded overhangs according to the 

manufacturer's directions, and then digesting with 

35 EcoRI. The 727 bp fragment thus generated containing 

the wild-type core ORF was isolated and subcloned into 
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the 403 0 bp vector backbone of pSC59 which had been 
digested with Xhol, treated with mung bean nuclease, 
and then digested with EcoRI. • Restriction digestion 
was used to verify that the insert was oriented so as 
to retain the EcoRI site at the C-terminal end of the 
core antigen. The resulting plasmid, pHTL-25 (Fig. 
42), contains the full-length core antigen oriented so 
as to permit expression of the core from the modified 
p7.5 promoter • The expression cassette of promoter 
and antigen in pHTL-25 is flanked by pieces of the 
vaccinia TK gene to allow in vivo recombination of the 
plasmid into the vaccinia TK gene. 

A vector was constructed to allow expression 
of a core-preSl fusion polypeptide (called 
core-preSl* , because only a portion of preSl is 
present). This core-preSl* fusion consists, starting 
at the amino terminus of the fusion protein, of amino 
acid residues 1 through 145 of core, a 3 amino acid 
spacer (ser-ala-cys) , amino acids 1 through 56 of 
preSl, and a 4 amino acid tail (arg-pro-thr-ser ; 
SEQ ID NO: 56). To make this fusion vector, a piece of 
the preSl region corresponding to the first 56 amino 
acids of preSl was amplified by polymerase chain 
reaction from pLEH-04 . We expected this region of 
preSl to be immunogenic; data presented by Gassin et 
al. showed that the anti-preSl immune responses of 
health workers vaccinated with a plasma-derived HBV 
vaccine were primarily against amino acid residues 
12-27 and 32-53 (Gassin et al., 1990, Prevalence of 
preSl peptides antibodies following HEVAC B vaccine, 
in Progress in Hepatitis B Immunization , Coursaget, P. 
and M.J*. Tong, eds. , John Libbey Eurotext, London). 
One primer used was of sequence* 5 1 TCA CTA TCC GGA ATC 
AGC TT 3' (SEQ ID NO: 23), which hybridizes from bases 
-22 to -3 relative to the ATG of preSl and is designed 
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to introduce a BspEI site (TCCGGA) upon amplification 
and to destroy a potential termination codon 
(underlined C was changed from an A in pLEH-04) . A 
second primer used was of sequence 5 1 GGT GAA TTC TGG 
5 CCC GAA TG 3 f (SEQ ID NO: 24) , which hybridizes to the 
opposite strand from bases +171 to +152 of the preSl 
region and is designed to create an EcoRI site 
(GAATTC) upon amplification. The product of the PCR 
reaction was double digested with BspEI and EcoRI, and 
10 the 178 bp product was gel- purified and cloned into 
the 4483 bp BspEI and EcoRI double digested vector 
fragment of pHTL-25 in an orientation which recreated 
both the EcoRI and BspEI sites at the ends of the 
insert. The resulting plasmid, pHTL-26 (Fig. 43), 
15 consists of the modified p7.5 promoter oriented so as 
to drive expression of the core-preSl* fusion. The 
nucleotide sequence (SEQ ID NO: 25) and predicted amino 
acid sequence (SEQ ID NO: 26) of the core-preSl* fusion 
made from pHTL-26 are shown in Figure 44 • 
20 Using a similar design to that of pHTL-26, a 

construct was made for expression of a fusion 
consisting, starting at the amino terminus of the 
fusion, of amino acid residues 1 through 144 of core, 
a one amino acid spacer (asp) , residues 1 through 55 
25 of preS2, residues 1 through 8 of S, and a 4 amino 

acid tail (arg-pro-thr-ser; SEQ ID NO: 56) . The preS2- 
and S-region insert in this construct was generated by 
polymerase chain reaction amplification of pLEH-04. 
One primer used was of sequence 5» CAT CCT CCG GAC ATG 
30 CAG TG 3' (SEQ ID NO: 27), hybridizes from base +313 of 
the preSl region to base +8 of preS2, and was designed 
to create a new BspEI site (TCCGGA) during 
amplification. A second primer which was used in 
conjunction with this first primer was of sequence 5' 
35 GTC CTA GGA ATT CTG ATG TG 3' (SEQ ID NO: 28), which 
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hybridizes to the opposite strand from the first from 
bases +31 to +12 of the S region, and creates a new 
EcoRl site (GAATTC) during amplification. The product 
of PGR of pLEH-04 using these primers was digested 
5 with EcoRI and BspEI, and the 190 bp fragment was gel- 
purified. This fragment was then cloned in a manner 
so as to regenerate EcoRI and BspEX sites at either 
end of the insert into the 4483 bp gel-purif ied vector 
fragment of pHTL-25 which, had been double digested 
10 with EcoRI and BspEI. The resulting plasmid, pHTL-27 
(Fig. 45) , is designed to allow expression of a core 
to preS2 fusion from the modified p7.5 promoter. The 
nucleotide sequence (SEQ ID NO: 29) and predicted amino 
acid sequence (SEQ ID NO: 30) of the core-preS2 fusion 
15 made from pHTL-27 are shown in Figure 46. 

A third construct of this type, consisting 
from the amino terminus of amino acid residues 1 
through 144 of core, a one amino acid spacer (asp) , 
residues 107 through 163 of the S antigen, and a 
20 7 amino acid tail (asn-ser-gly-leu-leu-val-lys; 

SEQ ID NO: 57) was made in a similar manner. This 
fusion polypeptide is referred to herein as core-S* 
because only a small portion of the S antigen, judged 
to be most important in terms of antigenicity, is 
25 present. Hydrophilic stretches of the S antigen 

expected to contain antigenic determinants are located 
between residues 110 and 156 of S (Bhatnagar et al., 
1982, Proc. Natl. Acad. Sci. USA 79:4400-4409). The 
nonapeptide sequence from 139 to 147 represents all or 
30 an essential part of the a determinant of HBsAg, the 
major epitope in S against which humans make 
antibodies after exposure to any subtype of the 
hepatitis B virus (id.). A 197- bp piece of the S 
region containing these immunogenic regions was 
35 generated by PCR amplification of pLEH-04 using 
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primers 5 1 GTA TGT TTC CGG ATT GTC CT 3» (SEQ ID 
NO:31), which hybridizes from bases +304 to +323 of 
the S region and was designed to generate a BspEI site 
(TCCGGA) upon amplification, and 5* TGA GGC CGA ATT 
5 CCA TAG GT 3« (SEQ ID NO: 32), which hybridizes to the 
opposite strand from bases +500 to +481 of S and was 
designed to create an EcoRI site (GAATTC) upon 
amplification. The product of PCR on pLEH-04 using 
these primers was double digested with EcoRI and 

10 BspEI , and the 172 bp product containing the S 

fragment was gel-purified. This fragment was then 
inserted, in a manner such that the restriction sites 
were regenerated at either end of the insert, into the 
4483 bp vector fragment of pHTL-25 which had also been 

15 doubly digested with EcoRI and BspEI • The resulting 
plasmid was called pHTI*-28 (Fig, 47). The nucleotide 
sequence (SEQ ID NO: 33) and predicted amino acid 
sequence (SEQ ID NO: 34) of the core-S* fusion made 
from pHTL-28 are shown in Figure 48. 

20 

6.4. DUAL EXPRESSION VECTORS 
A number of constructs were made consisting 
of two promoters, divergently oriented and subcloned 
so as to be located within the vaccinia TK gene. 

25 Various antigens were then subcloned downstream of one 
or both of the promoters, with the goal of allowing 
expression of two non-fused antigens from a single 
vaccinia recombinant containing the integrated 
plasmid. The first step in one scheme for making 

3 0 these constructs was to delete the /?-galactosidase 
gene from pSCIO by digesting the plasmid with BamHI 
and religating the larger fragment. The resulting 
plasmid, pSClOAlacZ (Fig. 4 9) , contains the 11 kD 
promoter located upstream of a unique BamHI site. At 

35 this point, the p7.5 and pli promoters were placed 
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M back to back" "to allow for future construction of a 
single expression vector capable of expressing two 
different antigens. To do this, the 117 bp Xbal-BamHI 
fragment of pSClOAlacZ was excised , blunted by Klenow 
5 treatment, and inserted into the 4690 bp fragment of 
pGS53 which had been linearized by Hindi. The 
orientation of the insert in this new construct, pDPV 
(Fig. 50) , was determined to be such that the two 
promoters are oriented back-to-back; double- digestion 
10 with EcoRI and BamHI yielded a restriction map 

consistent with fragments of sizes 9, 35, 375 and 4392 
bp. This process retains the Xbal site between the 
two promoters but destroys the BamHI site downstream 
of pll. 

15 pDPV was modified to insert S downstream of 

the 11 kD promoter. This was done by cutting pDPV 
with Xhol and inserting the 834 bp S-containing Xhol 
fragment of pGS53/S into this 4811 bp linearized 
vector-. Digestion of candidate miniprep DNAs with 

20 Xbal to yield restriction maps consistent with 

fragments of sizes 218, 1387, and 404 O bp was used to 
verify the correct orientation of the inserted S piece 
downstream of the pll promoter in the resulting 
construct, pDPV-01 (Pig* 51). This construct contains 

25 an extra ATG 23 bp upstream of the S antigen 
initiating ATG which, if left in place, would 
interfere with proper expression of S from its own 
ATG. Oligonucleotide-directed mutagenesis was used to 
change this ATG to an ATA. One oligonucleotide used 

30 hybridizes to the strand containing the ATG from bases 
-12 to -3 0 relative to the initiating ATG of S, and 
has the sequence 5 • CGA GGA ATT TAT TTA TAG C 3 • (SEQ 
ID NO: 3 5) , where the underlined T indicates the base 
to be mutated. The sequence of the region upstream of 
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the S ATG in the resulting plasmid, pRO-10 (Fig. 52), 
was confirmed by dideoxy sequencing of the plasmid. 

A vector, pRO-16 (Fig. 53) , was constructed 
from pRO-lO, allowing expression of the S antigen from 
5 the pli promoter and the coreAS antigen from the p7.5 
promoter ♦ This was done by cutting pT7T3/CODM with 
Hindlll and EcoRI, filling in the overhanging 5' ends 
using the Klenow fragment of DNA polymerase I, and 
gel-purifying the 721 ,bp fragment containing the 
10 coreA8 antigen. This fragment was then subcloned into 
the 5645 bp vector fragment of Smal-linearized pRO-10. 
The orientation of the insert in the resulting 
plasmid, pRO-16, was determined to be such that the 
two promoters were oriented back-to-back; BamHI 
15 digestion yielded a restriction map consistent with 
fragments of 275 and 6091 bp. 

pRO-10 was modified to insert the MS antigen 
downstream of p7.5 as follows. The 1257 bp MS 
fragment was removed from pT7T3/S2 by double digestion 
2 0 with Kpnl and Hindlll, blunting the ends of the 

fragment with T4 DNA polymerase, and gel-purifying the 
1.2 kb fragment. This piece was then inserted into 
Smal-linearized, bacterial alkaline phosphatase 
(BAP) -treated pRO-10. The orientation of the insert 
25 in the resulting plasmid, pRO-ll (Fig. 54) , was 

verified to be such that MS could be expressed from 
the p7.5 promoter; digestion with Xbal yielded 
fragments of sizes 218, 538, 1387 and 4759 bp. pRO-ll 
contains the S and MS ORFs downstream of pll and p7.5, 
30 respectively. 

The S fragment of pRO-ll was also replaced 
by coreA8. This was done by cutting pRO-ll with Xhol , 
blunting the ends of the pieces by Klenow treating, 
BAP-treating, and gel-purifying the vector fragment of 
35 6072 bp. Into this blunt-ended vector was inserted 
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*the coreA8 fragment: of pT7T3/CODM, generated by 
cutting pT7T3/CODM with Hindlll and EcoRI, blunting 
the ends with Klenow, and gel-purifying the 721 bp 
core-containing region. Orientation of the insert in 
5 the resulting plasmid, pRO-17 (Pig* 55), was 

determined to be such that coreAe could be expressed 
from the pll promoter; digestion with BamHX yielded a 
restriction map consistent with fragments of sizes 49, 
654 and 6090 bp. pRO-17 contains coreAS downstream of 
10 the 11 kD promoter and MS downstream of the 7.5 kD 
promoter. 

A plasmid analogous to pRO-17 but containing 
the full-length core gene instead of coreA8 was made 
in a manner identical to pRO-17, except that the 745 

15 bp insert was cut from pCODM/24 instead of pT7T3/CODM. 
The same vector and restriction sites were used. The 
orientation of the insert in the resulting plasmid, 
pRO-22 (Fig. 56) was determined to be such that full- 
length core should be expressed from the 11 kD 

20 promoter; BamHI digestion yielded a restriction map 
consistent with fragments of sizes 49, 654 and 6114 
bp. 

A vector was also created containing only 
the divergently oriented p7.5 and pll promoters by 

25 cutting pRO-10 with XhoX to excise the 834 bp 

S-containing fragment and religating the large 4811 bp 
vector fragment to delete the S region. The resulting 
plasmid, pRO-12 (Fig. 57), contains only the p7.5 and 
pll promoters and no HBV antigens. This plasmid 

30 differs from pDPV in that it lacks the extra ATG 

downstream of the pll promoter which was removed in 
the course of making pRO-10 from pDPV-01, 

From pRO-12, a vector was created consisting 
of the LS antigen inserted downstream of p7.5. This 

35 construct was made by cutting the I>S antigen from 
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pLEH-04 using a Hindlll and EcoRI double digest, 
filling in the overhanging 5' ends by Klenow 
treatment, and gel-purifying the 1329 bp fragment. 
This LS fragment was then subcloned into the 4811 bp 
5 vector fragment of Sraal-digested, BAP-treated pRO-12. 
The orientation of .the insert in the resulting 
plasmid, pRO-13B (Fig, 58), was determined to be such 
that LS should be expressible from p7.5; digestion 
with Xbal and PstI yielded a restriction map 
10 consistent with fragments of 125, 224, 641, 646, and- 
4 504 bp. pRO-13B therefore consists of the pll and 
p7 . 5 promoters oriented back-to-back with the LS open 
reading frame positioned downstream of p7.5. 



15 of pll in pRO-13B. This was done by digesting 

PT7T3/CODM with Hindlll and EcoRI, filling in the 
overhanging 5 1 ends with the Klenow fragment, and gel- 
purifying the 721 bp coreA8-containing fragment. This 
fragment was then subcloned into the 6144 bp XhoX- 

2 0 lineari2ed vector pRO-13B which had been Klenow- and 
then BAP-treated. The orientation of the insert in 
the resulting plasmid, pRO-18 (Fig. 59) , was 
determined to be such that the coreAS open reading 
frame should be expressed from the 11 kD promoter; 

25 digestion with BamHI yielded a restriction map 

consistent with fragments of sizes 376, 654 and 5835 
bp. 



was made by cutting pRO-17 with Xbal to remove the 
30 1496 bp region containing the pll promoter, core, and 
most of one-half of the vaccinia DNA, and cloning this 
piece into the 4040 bp Xbal-cut, BAP-treated vector 
fragment of pRO-12. Orientation of the insert in the 
resulting plasmid, pRO-19 (Fig. 60) , was determined to 
35 be such that the two promoters, p7 . 5 alone and pll 



The coreAB antigen was inserted downstream 



A vector featuring coreA8 downstream of pll 
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upstream of coreA8, were divergently oriented; 
digestion with BamHI yielded a restriction map 
consistent with fragments of sizes 654 and 4882 bp. 

A plasmid analogous to pRO-19 containing the 
5 pll-full-length core antigen cassette was made by 

cutting pCODM/24 with Hindlll and EcoRI, blunting the 
ends by Klenow treatment , and gel-purifying the 745 bp 
fragment. This core-containing fragment was subcloned 
into the 4811 bp Xhol- linear! zed pRO-12 which had also 

10 been made blunt-ended by Klenow. The resulting 

plasmid features full-length core downstream of pll. 
Orientation of the insert in the resulting plasmid, 
pRO-19/24 (Fig. 61) , was determined to be such that 
the full-length core gene could be expressed from the 

15 11 kD promoter; digestion with BamHI yielded a 

restriction map consistent with fragments of sizes 654 
and 4906 bp. 

To allow production of a series of 
constructs featuring the modified p7.5 promoter and 

20 the pll promoter back to back, another vector was 
made. pRO-19/24 was double digested with Xbal and 
EcoRI to excise the 893 bp fragment containing the 
pll-full-length core expression cassette. This 
fragment was gel-purified, the 5* overhanging ends 

25 filled in by Klenow fragment of DNA polymerase I, and 
the piece subcloned into the 4042 bp vector fragment 
of pSC59 which had been digested with Hindlll and 
treated with Klenow and BAP. To determine that the 
insertion had been made such that the two promoters, 

30 pll and the modified p7.5, were divergently oriented, 
the resulting plasmid, pHTI*-5 (Pig. 62) , was digested 
with PvuII and Hindlll to yield a restriction pattern 
consistent with fragments of 1735 and 3200 bp- pHTL-5 
contains the full-length core gene expressed from the 
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pll promoter, and the modified p7»5 promoter without 
any antigen cloned downstream. 



from the modified p7.5 promoter and' full-length core 
5 from the 11 kD promoter were made as follows. The S 
region from pT7T3/S and the LS region from pLEH-04 
were removed by polymerase chain reaction using the 
same pair of primers. One primer used was of sequence 
5» CTC ACT CGA GGG AAT AAG CT 3» (SEQ ID NO: 36), and 

10 was designed to hybridize from bases -23 to -4 

relative to the initiating ATG in each plasmid and to 
generate an Xhol site (CTCGAG) upon amplification. A 
second primer used in conjunction with the first 
primer was of sequence 5 1 TTG TTA GGC CTT AAA TGT AT 

15 3 f (SEQ ID NO: 37), and was designed to hybridize to 
the opposite strand from the first primer from bases 
+1179 to +1160 relative to the ATG of LS in pLEH-04 
and from bases +690 to +671 relative to the ATG of S 
in pT7T3/S, and to generate a StuI site (AGGCCT) upon 

20 amplification. The fragments generated by PCR using 
these primers on pT7T3/S and pLEH-04 were of sizes 714 
and 1203 base pairs, respectively. These amplified 
fragments were digested with Xhol and StuI, removing a 
few base pairs at either end and yielding fragments of 

25 size 699 and 1188 base pairs, respectively, which were 
purified from an agarose gel. These fragments were 
cloned so as to regenerate Xhol and StuI sites at 
either end of the insert, into the 4947 bp vector 
fragment of pHTL-5 which had been digested with Xhol 

30 and StuI. The S- and LS-containing plasmids generated 
in this manner were called pHTL-6 (Fig. 63) and pHTL-7 
(Fig. 64) , respectively. 



expressed from the modified p7.5 promoter and the 
35 full-length core antigen expressed from the 11 kD 



From pHTL-5, plasmids expressing S or LS 



A similar construct consisting of MS 
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promoter were made as follows. The 998 bp 
MS-containing fragment: of EcoRI-stuI digested pT7T3/S2 

- was gel-purif ied and subcloned into the 4929 bp 

EcoRI-stuI vector fragment of pHTI*-5. Restriction 

t 5 mapping was used to verify that the insertion had been 

made in an orientation so as to regenerate the EcoRI 
site at the 5 • end encoding the amino terminus of the 
MS insert and the StuI site at the 3 • end encoding the 
car boxy 1 terminus in the resulting .plasmid, pHTL-ll - 
10 (Fig. 65) . 

A plasmid for expressing MS from the 
modified p7.5 promoter and coreA8 from the pll 
promoter was made from pHTJL-11. To do this, a 849 bp 
Avrll-BspEI fragment of pHTI*-ll containing the two 

15 promoters and the $' end of each open reading frame 
was gel-purified and cloned into the 5742 bp 
Avrll-BspEI vector fragment of pRO-17 containing the 
3 9 ends of MS and coreA8 . The insert was made in such 
a way as to regenerate the Avrll and BspEI sites, and 

20 the resulting plasmid was called pHTL-12 (Fig. 66) . 
In this plasmid, the coreA8 antigen has replaced the 
wild-type core antigen which was present in pHTI*-ll. 

From pHTL-12 was made another construct, 
PRO-16M (Fig. 67), which consists of the coreA8 

25 antigen cloned downstream of the p7.5 promoter. This 
construct was planned so as to delete an out-of -frame 
ATG which is present 31 bp upstream of the correct 
initiating ATG of coreAS in pRO-16. pRO-l6M was made 
by linearizing pRO-12 with BamHI, which cuts 

30 downstream of the p7*5 promoter, blunting the ends 
with Klenow treatment, and then treating with BAP. 
The 719 bp cor eA8 -containing insert was gel-purified 
from pHTL-12 which had been digested with PstI and 
then treated with mung bean nuclease. From the 

35 ligation of this fragment of pHTL-12 into the 
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linearized pRO-12 were chosen clones containing the 
coreA8 antigen situated downstream of p7.5. The 
presence and correct orientation of the insert in 
pRO-16M were determined by Pstl/BspEI digestion, which 
5 yielded a restriction pattern consistent with 
fragments of 850 and 4684 bp. 

From pRO-16M was made an additional 
construct, pHTL-30 (Fig. 68) , consisting of MS 
expressed from the modified p7.5 promoter and coreA8 - 
10 expressed from the p7.5 promoter. The p7.5-coreA8 

cassette was excised from pRO-16M by PCR. One primer 
used for this reaction was of sequence 5' GTG GGT AAG 
CTT CTC GAT GT 3 1 (SEQ ID NO: 38) and is designed to 
hybridize from bases -230 to bases -211 relative to 
15 the ATG of coreAS, upstream of the p7.5 promoter, and 
to create a Hindlll site (AAGCTT) upon amplification. 
A second primer used in combination with the first 
primer was of sequence 5 • AGT TTC CAA GCT TAT GAG 3 1 
(SEQ ID NO: 39), and was designed to hybridize to the 
2.0 opposite strand from bases +563 to +546 and to 
generate a Hindlll site upon amplification. The 
product of a PCR reaction using these two primers was 
digested with Hindlll and the 774 bp fragment 
containing p7.5-coreA8 was gel-purified'. This 
25 fragment was inserted into pHTL-lOM which had been 

linearized with Hindlll just upstream of the modified 
p7.5 promoter* and BAP treated. Hindlll digestion was 
used to verify the presence of the insert in pHTL-30. 
Digestion with EcoRI/BspEI yielded a restriction map 
30 consistent with fragments of 706 and 5098 bp, 
verifying that the two promoters were oriented 
back-to-back . 

Using pHTL-5, a variant of pRO-16 consisting 
of S expressed from the pll promoter and full-length 
3 5 core, instead of coreAS, expressed from the p7.5 
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promoter was made. This was done by cutting pHTL-5 
with BspEI and Sad to excise the 268 bp fragment 

: coding for the carboxyl terminus of full-length core. 

This fragment was gel-purified and inserted into the 

' 5 6123 bp vector fragment of pRO- 16 which had been 

digested with BspEI and SacI and gel-purified. The 
orientation of the insert in the resulting plasmid, 
pHTL-13 (Fig. 69), was determined , based on the 
recreation of the BspEI and Sac! sites, to be such 
10 that the two promoters are divergently oriented. 

From pHTL-13 was made a plasmid, pHTL-15 
(Fig. 70), which contains the S antigen cloned 
downstream of the modified p7.5 promoter and full- 
length core cloned downstream of the p7.5 promoter. 
15 This plasmid was made by digesting pHTL-13 with XbaX 
and SacI to excise the fragment containing the p7.5- 
full-length core cassette. This fragment was gel 
purified, treated with mung bean nuclease, and the 
1008 bp resulting fragment inserted into the 4729 bp 
20 vector fragment of pHTI*-8 which had been linearized 
with Bell, Klenow filled, and BAP-treated. Presence 
of and correct orientation of the insert in pHTI*-15 
was verified by digestion with PstI to yield a 
restriction map consistent with pieces of 2223 and 
25 3514 bp. This plasmid may be used for additional 

cloning; however, it is preferably not used for making 
recombinant vaccinia viruses because there is a 
segment of about 600 bp of vaccinia. DNA between the 
two expression cassettes which may interfere with the 
30 recombination of both cassettes into the vaccinia 
genome. However, by deletion of this sequence, a 
plasmid useful for making recombinant viruses can be 
made . 

From pHTL-13 was made another plasmid, 
35 pHTL-17 (Fig. 71), containing MS downstream of the 
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modified p7.5 promoter* and full-length core downstream 
of the p7.5 promoter. This plasmid was made by 

digesting pHTL-13 with Xbal and SacI to excise the s 
fragment containing the p7 . 5-full-length core 
5 cassette. This fragment was gel-purified, treated t 
with mung bean nuclease, and the 1008 bp resulting 
fragment inserted into, the 5034 bp vector fragment of 
pHTL-10 which had been linearized with Bell and Klenow 
filled. Presence of the insert in pHTL-17 was 
10 verified by digestion with PstI, which yielded a 

restriction map consistent with pieces of sizes 1000, ~ 
2223, and 2819 bp* Orientation of the insert was 
verified by this digest to be such that the two 
promoters are back- to-back . This plasmid may be used 
15 for additional cloning; however, it is preferably not 
used for making recombinant vaccinia viruses because 
there is a segment of about 600 bp of vaccinia DNA 
between the two expression cassettes which may 
interfere with the recombination of both cassettes 
2 0 into the vaccinia genome. However, by deletion of 

this sequence, a plasmid useful for making recombinant 
viruses can be made. 

From pHTL-15 and pHTIi-30 was made another 
plasmid, pHTL-33 (Fig. 72), consisting of full-length 
25 core cloned downstream of p7.5 and MS with the 

upregulated ATG cloned downstream of the modified p7.5 
promoter. This plasmid was made by polymerase chain 
reaction amplification of the p7 . 5-full-length core 
cassette out of pHTL-15 using primers of the sequence 
30 5' CTC GAT GTC GAC TAG CCA TA 3* (SEQ ID NO: 58), which 
hybridizes to pHTL-15 from positions -237 to -218 
relative to the ATG of core and the last four bases of 
which are part of an Ndel site .(CAT ATG) , and 5' 
AAG TTG TCG ACC TTA TGA GT 3' (SEQ ID NO: 59), which 
35 hybridizes to the opposite strand from positions +587 
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to +568, downstream of the termination codon for full- 
length core, and which contains a Sail site (GTCGAC) . 
The product of the PCR reaction was digested with Sail 
and Ndel and the 798 bp fragment containing the core 
5 expression cassette was isolated from an agarose gel 
and cloned into the 5046 bp gel purified vector 
fragment of pHTL-30 which had been digested with Sail 
and Ndel. Presence of the insert and situation of the 
two cassettes in a back -to-back orientation in pHTIi-33 

10 was determined by digestion with Sail and NdeX to 

yield a restriction map consistent with fragments of 
sizes 798 and 5046 bp. 

From pHTL-33 was made an additional plasmid, 
pHTL-34 (Fig. 73), consisting of S expressed from the 

15 modified p7.5 promoter and full-length core expressed 
from the p7.5 promoter. This plasmid was made by 
digesting pHTL-8 with Xhol and ClaX and gel purifying 
the 767 bp fragment containing the S antigen. This 
fragment was cloned into the gel purified 4772 bp 

20 vector fragment of pHTI*-33 which had also been 
digested with Xhol/Clal- Presence and correct 
orientation of the insert in pHTI»-34 were verified by 
digestion of the plasmid with Xhol/Clal to yield a 
restriction map consistent with fragments of 767 and 
25 4772 bp. 

Also from pHTX.-33 was made another plasmid, 
pHTL-35 (Fig. 74$ r consisting -of LS expressed from the 
modified p7.5 promoter and full-length core expressed 
from the p7.5 promoter. This plasmid was made by gel 

30 purifying the 1256 bp Xhol/Clal fragment of pHTL-9 
containing the LS antigen and inserting it into the 
4772 bp vector fragment of pHTL-3 3 which had also been 
digested with Xhol/Clal. Presence and correct 
orientation of the insert in pHTL-35 were verified by 

35 digestion of the plasmid with Xhol and Clal to yield a 
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restriction map consistent with fragments of 1256 and 
4772 bp. 

6.5- PRES1-CORE FUSION ^CONSTRUCTS 
5 A series of constructs consisting of t 

preSl-core fusions was also made* PreSl-core fusions 
included only the preSl region of the LS antigen 
(i.e., lacking the preS2 and S regions) fused to one 
of the two types of core (full length or coreAS) . 

10 pPB-05 (Fig. 76) contains the preSl-encoding region 

fused to the coreA8 gene, with preSl constituting the - 
amino terminus of the resulting fusion protein. 
pPB-09 (Fig. 77) contains the preSl-encoding region 
fused to the full-length core gene in the same 

15 orientation. Both of these constructs consist of 

inserts into the standard cloning vector, pT7T318, so 
that the open reading frames are flanked by 
restriction endonuclease sites and can be further 
manipulated into expression vectors as needed. 

2 0 To make pPB-05, the coreAS open reading 

frame was cloned by polymerase chain reaction 
amplification from pT7T3/CODM. The primers used to 
accomplish this were of sequence 5 1 GCG CCA TGG ACA 
TTG ACC CTT ATA 3« (SEQ ID NO: 40), which hybridizes 

2 5 from bases -5 to +19 relative to the initiating ATG of 

coreA8, and 5* CCC TGA TCA CTA ACA TTG AGA TTC CCG A 
3» (SEQ ID NO: 41) , which hybridizes to the opposite 
strand from bases +543 to +516 relative to the ATG. 
These primers were designed to generate a new Ncol 
30 site (CCATGG) at the 5* end and a new Bell site 

(TGATCA) at the 3 • end of the amplified open reading 
frame, respectively. The amplified reaction products 
were digested with Ncol and Bell, and then separated 
by electrophoresis on an agarose gel. A 53 6 base pair 

3 5 fragment corresponding to the desired piece of coreAS 
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was excised and electroeluted from the gel. This 
fragment was then cloned into the 4601 bp vector 
fragment of pET-3d (Fig. 78), a vector for bacterial 
expression of cloned inserts from the T7 promoter 
5 (Studier et al., 1990, Neth. Enzymol. 185:60-89). pET 
system vectors were obtained from Novagen, Inc. 
(Madison, WI) ; and cut with Ncol and BamHI (compatible 
with Bell) .* The resulting plasmid, pT7/core (Fig. 
79) , contains the coreAS ORF cloned downstream of the 
10 T7 promoter. Restriction mapping was used to 

determine that the insert in pT7/core had been made so 
as to retain the Ncol site at the 5 • end of the ORF 
encoding the amino terminus. 

Plasmid pT7/core was then transformed into a 
15 derivative of B. coli strain BL21 (F~ ampT rj,-^-) 
containing the DE3 lysogen. DE3 is a lambda 
derivative that carries in the Int gene a DNA fragment 
containing the lacl gene, the lacUVS promoter, the 
beginning of the lacZ gene, and the gene for T7 RNA 
20 polymerase. In a DE3 lysogen, the lacUVB promoter is 
inducible by isopropyl-jS-D-thiogalactopyranoside 
(IPTG) , such that addition of IPTG to a growing 
culture of this lysogen induces T7 RNA polymerase 
(Studier et al. , supra). The BL21(DE3) lysogen was 
25 transformed with pT7/core, and host cells carrying the 
recombinant pT7/core plasmid were grown in 1 liter of 
LB broth at 37 *c. CoreAS synthesis was induced by the 
addition of 0.4 mM IPTG, so that the coreAS gene was 
expressed from the T7 promoter in pT7/core. 
30 This induced culture was harvested, and the 

coreAS antigen, which is expressed intracellular ly, 
was partially purified. Induced bacterial cells were 
harvested by low speed centrif ligation and resuspended 
in 20 ml per liter of culture modified lysis and 
35 sonication (L&S-l) buffer (50 mM Tris pH 8.0, 10 mM 
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EDTA, 10 mM dithioerythritol (DTT) , 1 mM 
phenylmethylsulfonyl fluoride (PMSF) , 0.25 mg/ml 

lysozyme, 4 mg of deoxycholate (DOC) per gram of ; 
cells) . The cell suspension was sonicated three times 
5 (Braun-Sonic 2000, half-max. high, 10 seconds) and * 
then centrifuged at 12,000 x g for 40 min at 4°C. The 
supernatant was then treated with 30% ammonium 
sulfate, stirred for 1 hour at 4°C, and centrifuged at 
12,000 x g for 40 min. Ammonium sulfate was then 
X0 added to the supernatant to 40%. This mixture was 
stirred for 1 hour at 4*C and centrifuged at 
12,000 x g for 40 min. The precipitated pellet was 
then resuspended in 6 M urea, 50 mM Tris pH 8.0, 10 mM 
EDTA, 10 mM DTT, 1 mM PMSF buffer to a protein 
15 concentration of 3 mg/ml. This dissolved solution was 
loaded onto a sizing column (Bio-Rad A1.5, 2.5 x 100 
cm) , equilibrated and eluted with the same buffer. A 
protein profile of the eluted fractions was obtained 
by absorbance at 280 nm, or by a micro protein assay 
2 0 (Bio-Rad) . One major peak containing coreA8 peptide 
was observed. Fractions containing this peak were 
pooled and dialyzed against Tris/saline (10 mM Tris, 
0.15 M NaCl) , pH 7.5. 

The purification steps for the recombinant 
25 peptide, expected to be of molecular weight 20.3 kD, 
were followed by SDS-PAGE analysis on a 15% gel. 
Samples were mixed with SDS sample buffer to a- final 
concentration of 25 mM Tris-HCl pH 6.8, 4% glycerol, 
0.8% SDS, 2% 0-mercaptoethanol, 0.0005% bromophenol 
30 blue. CoreA8 purified in this manner was substituted 
for the Biodesign core antigen in the immunoassays 
described in example 8.3 below. 

The next step in the construction of pPB-05, 
the preSl-coreA8 fusion vector, was to transfer the 
35 coreA8 antigen back into a general cloning vector. 
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pT7/core was digested with Xbal and Hindlll to excise 
the 1056 bp coreAB-containing piece, and this piece 
was subcloned into the 2859 bp vector backbone of 
PT7T318 which had been digested with Xbal and Hindlll 
5 so as to reconstruct the Xbal and Hindlll sites. The 
resulting plasmid, pPB-04 (Fig. 80), contains the 
coreA8 antigen surrounded on either end by restriction 
sites. 

To make a fusion of the preSl region to the 
10 5» end of the coreA8 region in pPB-04, the preSl 

region was cloned by PCR out of pRO-02. The primers 
used for this purpose were of sequence S 9 ATT CTC GAG 
GGC ATG GGG ACG 3« {SEQ ID NO: 42), which hybridizes 
from bases -12 to +9 relative to the initiating ATG of 
15 preSl, and 5» ACT CCA TGG CCT GAG GAT GA 3' (SEQ ID 
NO: 43), which hybridizes to the opposite strand from 
bases +331 to -f312 relative to the ATG. These primers 
were designed to generate Xhol (CTCGAG) and Ncol 
(CCA TGG) sites at the amino and car boxy 1 termini, 
20 respectively, of the 343 bp amplified preSl ORF. The 
reaction mixture generated by PCR was digested by 
Xhol, the ends filled in using the Klenow fragment of 
DNA polymerase I, and the DNA cut by Ncol. The 
resulting 333 bp fragment containing the preSl region 
25 was gel-purified and cloned into the 3865 bp vector 
fragment of pPB-04 which had been cut at Smal in the 
polylinker region at the 5* end of coreAS and at Ncol 
at the initiating ATG of the coreAS antigen. 
Restriction mapping was used to confirm that the 
3 0 inserted DNA was oriented so as to recreate the Ncol 
site at the fusion point of the preSl and core 
regions. The resulting plasmid, pPB-05, contains a 
fused preSl-coreAS open reading, frame surrounded by 
restriction sites. The nucleotide sequence (SEQ ID 
35 NO: 44) and predicted amino acid sequence (SEQ ID 
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NO: 45) of the preSl-coreA8 fusion region in pPB-05 are 
shown in Figure 81. 



pRO-21 (Fig. 82), containing the MS antigen expressed 
5 from the p7.5 promoter and the preSl-coreA8 antigen 
expressed from the pll promoter. pPB-05 was digested 
with EcoRI and Hind I I I to excise the 

preSl-coreA8-containing fragment and this fragment was 
made blunt-ended by Klenow treatment. This 1370 bp 

10 insert containing the preSl-coreA8 fusion was gel- 
purified and inserted into the 6072 bp Xhol vector 
fragment of pRO-11 which had been Klenow-blunted and 
dephosphorylated by BAP treatment. The orientation of 
the insert was verified by BamHI digestion to give a 

15 restriction map consistent with fragments of 49, 966 
and 6427 base pairs, indicating that the preSl-core£8 
open reading frame in pRO-21 is oriented so as to be 
expressible from the pll promoter. 



2 0 preSl-full-length core fusion, was constructed by 
first amplifying the full-length core antigen from 
pCODM/24 by PCR. The primers used for this purpose 
were of sequence 5' GCG CCA TGG ACA TTG ACC CTT ATA 3* 
(SEQ ID NO:46), which hybridizes from bases -5 to +19 

25 relative to the initiating ATG of core, and 5 • AGT GAA 
GCT TCC CAC CTT 3» (SEQ ID NO: 47), which hybridizes to 
the- opposite strand from bases +592 to +575 relative 
to the full-length core ATG. These primers were 
designed to generate Ncol (CCATGG) and Hindlll 

30 (AAGCTT) sites at the N-terminal and C-terminal ends 
of core, respectively, upon amplification. The 
resulting reaction mixture was digested with Ncol and 
Hindlll. The 584 bp core-containing piece was then 
gel-purified and inserted into the 3181 bp gel- 

35 purified vector backbone of pPB-05 from which the 



From pPB-05 .was made an expression plasmid. 



pPB-09, the plasmid containing the 
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coreAS open reading frame had been removed by 
Ncol/Hindlll digestion. Restriction mapping was used 
to verify that the insert in the resulting plasmid, 
pPB-09, was oriented so as to reconstruct both the 
5 Ncol and Hindlll sites. This process had the effect 
of switching the full-length core antigen for coreAS ♦ 
The nucleotide sequence (SEQ ID NO: 48) and predicted 
amino acid sequence (SEQ ID NO: 49) of the preSl-full- 
length core fusion in pPB-09 are shown in Figure 83 . 
10 An analogous construct to pRO-21 in which 

the full-length core antigen was substituted for the 
coreAS antigen was made from pPB-09 and called pRO-23 
(Pig. 84). To make pRO-23, the preSl-f ull-length core 
insert was purified from pPB-09 by digesting pPB-09 
15 with EcoRI and Hindlll, and using Klenow fragment to 
blunt the ends. This 937 bp fragment was gel-purified 
and subcloned into the 6072 bp vector fragment of 
pRO-ll which had been digested with Xhol and the ends 
filled by Klenow. The correct orientation of the 
20 insert in the resulting plasmid, pRO-23, was verified 
to be such as to allow expression of the fusion from 
the pll promoter; digestion with BamHI yielded a 
restriction map consistent with fragments of 49, 966 
and 5994 base pairs. pRO-23, therefore, consists of 
25 MS downstream of the p7.5 promoter and the preSl-full- 
length core antigen fusion downstream of the pll 
promoter . 

From pPB-05, a plasmid was made for the 
expression of the preSl-coreA8 fusion from the 11 kD 

30 promoter* This was done by isolating the 1365 bp 

EcoRI-Hindlll fragment of pPB-05 containing the fusion 
ORF and blunting the ends with Klenow. This 137.0 bp 
fragment was inserted into the 4149 bp vector fragment 
of pHTL-5 which had been cut with Hindlll and BamHI 

35 and made blunt-ended by Klenow. The orientation of 
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the insert in the resulting plasmid, pHTI>-23 (Fig. 
85), was determined to be such that the preSl-coreA8 
should be expressed from the 11 kD promoter; NcoI/StuI 
digestion yielded a restriction map consistent with 
5 fragments of 532 and 4987 bp. 

A similar plasmid allowing expression of the 
preSl-full-length-core antigen from the pll promoter 
was constructed from pHTL-23. This was done by 
cutting pHTL-5 with Sad, treating with mung bean 

10 nuclease to blunt the overhanging ends, and then 

cutting with Bglll and isolating the 611 bp fragment ; 
of pHTL-5 containing the 3* end encoding the carboxyl 
terminus of the full-length core antigen. This insert 
was then ligated into the 4785 bp vector fragment of 

15 pHTL-23 which had been digested with Nhel, mung bean 
and BAP-treated, and then digested with Bglll to 
excise the 3» end encoding the carboxyl terminus of 
the coreA8 antigen- The orientation of the insert in 
the resulting plasmid, pHTL-24 (Fig. 86) , was verified 

20 to be such that the preSl-full-length-core antigen 
could be expressed from the pll promoter; digestion 
with Bglll and StuI yielded a restriction pattern 
consistent with fragments of 617 and 4779 bp . 

Another plasmid, pHTL-18 (Fig. 87), which 

25 was made from pRO-23, contains the MS ORF downstream 
of the modified p7 . 5 promoter and the preSl-full- 
length core fusion ORF downstream of the p7.5 
promoter. This plasmid was made by isolating a 763 bp 
fragment containing preSl fused to the 5 1 end of the 

3 0 full-length core gene generated by cutting pRO-23 with 
Xhol, Klenow filling and then digesting with BspEI. 
This gel— purified insert was then inserted into the 
5579 bp vector fragment of pHTL-17 made by cutting 
with SphI, blunting with mung bean nuclease, and then 

35 digesting with BspEI. Orientation of the insert was 
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determined by restriction mapping to be such that the 
BspEI site near the 3 • end of full-length core had 
been regenerated* This plasmid contains an insertion 
of vaccinia DNA between the two promoters and as such 
5 is difficult to use to make recombinant viruses, but 
it is useful for cloning purposes* The region between 
the two promoters can be deleted to make a plasmid 
which can be used to generate recombinant viruses. 

From pHTI,-l8 was made an additional plasmid , 
10 pHTL-14 (Fig. 88) , consisting of the MS antigen 
downstream of the modified p7.5 promoter, and the 
preSl-coreA8 fusion downstream of the p7.5 promoter* 
To make this plasmid, the 819 bp core antigen region 
was isolated from pRO-21 by digestion with Nhel, 
15 Klenow blunting and Ncol digestion. This fragment was 
then gel-purified and inserted into the 5290 bp vector 
fragment of pHTIi-18 which had been digested with Pvull 
and Ncol* The orientation of the insert in pHTL-14 
was verified by restriction mapping to be such that 
2 0 the Ncol site at the junction of the preSl and core 
ORFs had been regenerated. Like pHTL-17 and pHTL-18, 
pHTL-14 contains a region of vaccinia DNA between the 
two promoters. This region should be deleted, and a 
larger region of the vaccinia TK gene inserted 
25 downstream of the preSl-coreA8 ORF, in order to use 
the plasmid for generation of recombinant virus. 
However, even without these changes, pHTL-14 is useful 
for cloning purposes. 

From pHTL-14, was made an additional 
30 plasmid, pHTL-31 (Fig. 89), consisting of MS with the 
upregulated ATG as described for pHTL-lOM above, 
expressed from the modified p7.5 promoter, and the 
preSl-coreA8 fusion expressed from the p7.5 promoter. 
To make this plasmid, pHTL-14 was digested with Ndel 
35 and BspEI to isolate the region containing p7.5-preSl 
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and the 5" end of coreA8- This 949 bp fragment: was 
gel-purified and inserted into the 5174 bp vector 
fragment of pHTL-30 which had also been digested with 
NdeX and BspEX and gel-purified. The insertion in 
5 pHTL-3l was made so as to recreate both the NdeX and 
the BspEX sites. 

From pHTXi— 31 was made an additional vector, 
pHTL-32 (Fig. 90) , consisting of the preSl-coreAS 
antigen expressed from the p7.5 promoter and the 

10 modified p7.5 promoter with no antigen downstream. 
This plasmid was made by digesting pHTL-31 with Xhol 
and StuI, Klenow filling, gel-purifying the 5121 bp 
vector fragment from which the MS gene had been 
deleted, and self-ligating this plasmid to reclose the 

15 circle. 

From pHTL-33 and pHTIi-31 was also made a 
plasmid, pHTL-36 (Fig. 75) , which is an analog of 
pHTL-31 in which full-length core has been substituted 
for coreA8. The plasmid pHTL-36, therefore, consists 

20 of MS with the upregulated ATG expressed from the 
modified p7.5 promoter and the fusion preSl-full- 
length core expressed from the p7.5 promoter. This 
plasmid was made by digesting pHTI*-33 with BspEI and 
Sail and gel purifying the 149 bp fragment containing 

25 the 3* end of the full-length core antigen. This 

fragment was inserted into the 5995 bp vector fragment 
of pHTL-31 which had been digested with BspEI and Sail 
as well. Presence and correct orientation of the 
insert in pHTL-36 were verified by digestion of the 

3 0 plasmid with BspEX and Sail to yield a restriction map 
consistent with fragments of sizes 149 and 5995 bp. 

7. CONSTRUCTION OF RECOMBINANT 
TK" VACCINIA VIRUSES 

35 Recombinant vaccinia viruses, for vaccine 

use, were made by infecting Vero cells (ATCC Accession 
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No. CCL 81) with wild-type vaccinia virus (New York 
City Department of Health Laboratories vaccine strain, 
prepared by Wyeth) , and subsequently transf ecting the 
infected cells with a plasmid containing the genes to 
5 be inserted flanked by sequences of the vaccinia TK 
gene, within which homologous • recombination occurred. 
Transf ections were performed by one of several methods 
described in Section 5.3, supra. Recombinant viruses 
were selected by plating on Rat2 cells (ATCC Accession 

10 No. CKL 1764) in the presence of BUDR. Each 
recombinant was purified by three rounds of 
purification from a single viral plaque, and the 
identity and purity of each viral stock were verified 
by commercially available immunoassays for S and e 

15 (Abbott Laboratories) , and by Southern and western 
analyses as described in sections below. 

8. ANALYSIS OF RECOMBINANT VIRUSES 



20 8.1. SOUTHERN ANALYSIS 

A conventional Southern blotting procedure 
(Southern, 1975, J. Mol. Biol. 98:503) was used to 
verify the identity and purity of the recombinant 
viruses which had been plaque-purified. Viral and 

25 cellular DNA was isolated using a miniprep procedure 

from a loo mm dish of infected host cells. Cells were 
scraped from the dish into a 15 ml centrifuge tube, 
spun to pellet, washed in phosphate buffered saline 
containing 1 mM MgCl 2 (PBS-M; PBS = 137 raM NaCl, 3 mM 

30 KC1, 8 mM Na 2 HP0 4 , 1.5 mM KH 2 P0 4 , pH 7.5), respun and 
resuspended in 0.6 ml PBS-M. Cells were transferred 
to a 1.5 ml microcentrifuge tube and the following 
were added: 30 /il of 10% Triton X-100, 1.5 /zl of 
j8-mercaptoethanol, and 48 /il of 250 mM EDTA (pH 8.0). 

35 The tubes were vortexed and incubated on ice for 10 



17:57:02 



WO 94/12617 PCT/US93/11474 

- 98 - 

minutes with occasional vortexing . Tubes were then 
spun 800 x g 3 minutes to remove cellular material, 
the supernatant was transferred to a fresh tube and 
spun 16,000 x g for 10 minutes to pellet viral cores. 
5 The pellet of this second spin was resuspended in 100 
Ml TE (10 mM Tris, 1 mM EDTA, pH 8.0) and the 
following were added: 1.5 /il 10 mg/ml proteinase K, 
6.7 /il 3 M NaCl, 10 fxl 10% SDS, and 0.3 pi 
jS-mercaptoethanol . Tubes were mixed gently and 
10 incubated 30 minutes at 55 *C. Samples were then 

extracted twice with an equal volume 1:1 phenol :CHC1 3 
without vortexing, and ethanol precipitated twice by 
addition of 4 /il 3 M NaCl and 2.5 volumes ethanol. 
Precipitated DNA was resuspended in 20 /il H 2 0. 
15 An aliquot of this DNA was then digested 

with a restriction enzyme such as Hindlll, size 
fractionated by electrophoresis on an agarose gel with 
appropriate markers, and the gel stained and 
photographed. The DNA in the gel was then transferred 
2 0 to a nylon membrane (Hybond N, Amersham Corp., 

Arlington Heights, IL) by capillary action. After the 
transferred DNA was baked onto the membrane, the 
membrane was prehybridized and hybridized to a labeled 
nucleotide probe. To probe for the S antigen, a 836 
25 bp fragment of the plasmid pT7T3/S which had been 

digested with Hindlll and EcoRI was gel-purified and 
nonradioactively labeled by commercially available 
means (Genius Nonradioactive Labeling and Detection 
System; Boehringer Mannheim, Indianapolis, IN) , in 
30 which the DNA was labeled in a random-primed 

incorporation reaction by the introduction of dUTP 
which had been conjugated to the steroid hapten 
digoxigenin. Following hybridization with the labeled 
probe, the blot was washed and the hybridized probe 
35 visualized. Specifically, an alkaline 
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phosphatase -conjugated antibody to digoxigenin was 
added and a chemi luminescent alkaline phosphate * 
substrate (Lumiphos, Boehringer Mannheim, 
Indianapolis, IN) was used to visualize the bound 
5 antibody on X-ray film. 



for hybridization to probes for the S, preS2, preSl, 
and core regions, made by techniques analogous to 
those described supra, from plasmids described supra 

10 or others; the identity of the recombinants was thus 
verified. A probe hybridizing within the TK gene of 
vaccinia virus was also used in assessing the identity 
and purity of recombinants. Such a probe was made by 
cutting pGS53 with XbaX and EcoRI to excise the TK 

15 region, gel-purifying the 630 bp fragment, and 
labeling this fragment as described supra. 
Contamination with wild-type vaccinia virus is ruled 
out by hybridization with such a TK-specific probe, 
which gives a unique banding pattern for each 

2 0 recombinant or wild- type virus. 



25 western analysis technique (Burnette, 1981, Anal. 
Biochem. 112:195), using antibodies to the various 
hepatitis B antigens as probes. 



intracellular proteins probed with antibody to the S 
30 region, proteins expressed transiently from 

recombinant plasmids in infected cells (Rose et al. , 
supra) or expressed in cells infected with recombinant 
viruses were isolated. The procedure for harvesting 
samples from recombinant- infected cells was as 
35 follows. Infected cells were scraped off a 100 mm 



DNA from recombinant viruses was analyzed 




Specifically, to perform a western blot of 
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dish into a 15 ml centrifuge tube, collected by 
centrifugation, washed with IX PBS, resuspended in 1 
ml IX PBS f and disrupted by 3 cycles of freeze/ thaw. 
Samples were then sonicated on an ice/H 2 0 bath, 
5 aliquotted and stored at -70 'C. 60 pi of this sample 
was mixed with 20 /il 4X Laemmli buffer (125 mM Tris pH 
6.8, 20% (v/v) glycerol, 4% (w/v) SDS, 10% (v/v) 
0-mercaptoethanol, 0.0025% (w/v) bromophenol blue), 
boiled 5 minutes, loaded on a 15% SDS/polyacrylamide 

10 gel along with appropriate size standards, and 
subjected to electrophoresis overnight at 50 V 
(Laemmli, 1970, Nature 227:680). From each 100 mm 
dish transient expression experiment, pellets were 
resuspended in 250 pi IX PBS and 50 fil of this sample 

15 was mixed with 17 /il 4X Laemmli buffer. 

The separated proteins were transferred 
(Protean II apparatus and Trans-blot Cell, BIORAD 
Laboratories, Richmond, CA) to nitrocellulose 
(Schleicher & Schuell, Keene, NH) at 70 V for 2 hours 

20 in 25 mM Tris pH 8.3, 192 mM glycine, 20% methanol. 
The membrane containing the transferred proteins was 
blocked with blocking buffer (2% nonfat dry milk in 
H 2 0) for 1 hour at room temperature and then incubated 
overnight at room temperature in 25 ml blocking buffer 

25 containing 1/250 rabbit anti-HBsAg polyclonal antibody 
(cat. no. KM63P, Accurate Chemical & Scientific Corp., 
Westbury, NY) . The blot was then washed 3 X 10 
minutes in 0.1% Tween 20, 100 mM Tris pH 7.5, 1.5 M 
NaCl, 0.02% J*aN 3 . Next, a goat anti-rabbit IgG 

30 alkaline phosphatase-conjugated second antibody (cat. 
no, 8612-0081, Cappel Research Products, Durham, NC) 
was diluted 1/10,000 in blocking buffer and incubated 
with the blot for 2 hours at room temperature. The 
blot was then washed 3 X 10 minutes as above. Color 

35 was developed by addition of alkaline phosphatase 
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substrate (NBT/BCIP, Promega Protoblot System, 
Promega, Madison, WI) in alkaline phosphatase buffer 
(100 mM Tris pH 9.5, 100 mM NaCl, 5 mM MgCl 2 ) . 

A representative Western blot of 
5 polypeptides in the cell pellet of a number of 

recombinant viruses probed with anti-S antibodies is 
shown in Figure 91. It is apparent from this analysis 
and other similar gels that in* some constructs 
designed to express the MS antigen, significant 

10 expression of a polypeptide of the correct size for S 
occurs as well, and that the relative distribution of 
MS and S varies among the constructs. In the 
constructs represented in this western blot, no care 
was taken to match the sequence upstream of the 

15 initiating ATG of MS to the Kozak consensus sequence 
for ribosome binding described supra. Analysis of 
constructs such as pHTI>-10M in which the consensus 
sequence had been matched did indicate a higher MS to 
S ratio. The smaller surface proteins, MS and S, were 

20 also seen in some of the LS-producing viruses, 

particularly pGS52/Sl. A similar approach can be 
taken in the I*S constructs to try to upregulate the LS 
initiation codon. 

With respect to two independent recombinant 

25 vaccinia viruses derived from pHTL-28 (encoding 
core-S* operably linked to modified p7.5) and 
purified, the production of e sequences was detected 
by enzyme-linked immunoassay (ELISA) procedures using 
anti-e antibody, but no protein products reactive with 

30 anti-S antibody were detectable by ELISA. Southern 
analysis of the viral DNA confirmed the presence of 
' both core and S region sequences- We also performed 

western analyses on the proteirts isolated from cells 
infected with the purified recombinants. Both the 

35 anti-core and anti-S antibodies appeared to detect 



17:57:02 



WO 94/12617 PCI7US93/11474 

- 102 - 



weakly the same two bands in each blot, the larger of 
which was about the expected size for the core-S* 
fusion protein. This, suggests that these recombinant 
viruses do make some form of fusion, but that perhaps 
5 the epitopes are in an unusual conformation and are 
harder to detect. We conclude that perhaps when we 
make a fusion to core of the piece of S contained in 
pHTL-18 (which should include the dominant a epitope) , 
the conformation of S is changed such that it is no 
10 longer detectable, or at least not easily detectable; 
with the standard anti-S antibodies* 

8.3. IMMUNOASSAYS FOR DETECTION OF ANTIBODIES 
TO THE HBV ANTIGENS AND TO VACCINIA VIRUS 

Enzyme linked immunoassay (EIjISA) procedures 

were used to detect antibodies directed to vaccinia 

virus coat proteins and to HBV S, preSl, preS2 and 

core regions in the serum of mice which had been 

immunized with recombinant viruses. The amount of 

2Q HBV-specific antibody present in test serum was 
quantified by interpolation from a standard curve 
established using serial dilutions of a monoclonal 
antibody of known titer specific for HBV S, preSl, 
preS2, or core regions. 

25 Microtiter plates were coated with purified 

wild- type vaccinia virus, core protein, plasma-derived 
HBsAg , a synthetic peptide derived from preSl, or a 
synthetic peptide derived from preS2. After coating 
with antigen, the microtiter plates were rinsed with 

30 water and then blocked with non-fat milk. Plates were 
then washed with a Tris buffer (10 mM Tris pH 7.5, 
0.15 M NaCl, 0.5% Tween-20) . 

50 til of serum sample or antibody standard 
were then added to triplicate antigen-coated, blocked 

35 wells. The serum dilution used was 1:100 in 

Tris/saline (10 mM Tris pH 7.5, 150 mM NaCl) plus 1% 
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fetal bovine serum. The reaction was allowed to 
proceed for 90 minutes at 37 *C, and the plates were 
washed to remove unbound antibody. Antigen-bound 
antibody was detected by the addition of polyclonal 
5 goat anti-mouse antibody conjugated with alkaline 

phosphatase. After incubation for 60 minutes at 37 "c, 
the plates were washed to remove unbound conjugate and 
the color reaction was developed. After 15 to 30 
minutes, dilute sodium hydroxide was added to stop the 

10 reaction and the absorbance at 405 nm was read in an 
EI*ISA photometer. The best-fit polynomial equation 
was calculated for the standards for each assay , and 
this curve was then used to solve for the 
concentration of antibody associated with each OD^ 

15 value for the mouse serum samples* 

8.4. PRELIMIN ARY DOSE AND STRAIN ANALYSES 

Mice are often used in initial attempts to 
assess the immunogenicity of potential vaccines. The 
2 0 immune response of mice to the hepatitis B surface 
antigens is known to vary depending on the H-2 
haplotype of the strain (Milich et al., 1984 , J. Exp. 
Med. 159:41; Milich et al., 1985, Proc. Natl. Acad. 
Sci. USA 82:8168). However , we were unable to find 
25 any published data on the effect of haplotype on the 
response of mice to hepatitis B antigens when 
expressed from vaccinia virus. We therefore performed 
an experiment to assess the importance of the mouse 
strain and dose used. Seven week old female mice of 
30 either C57BL/6 or BALB/c strain were inoculated 

intraperitoneal ly with 0.5 ml of a 1:1 mixture of 
recombinant viruses made from pGS53/S2 and pRO-18. 
The combination of these two viruses should result in 
the expression of all of the following epitopes: 
35 core, preSl, preS2 , and S; other combinations of 
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viruses could also have been used. Total doses of 
recombinant vaccinia virus of 1 X 10 7 or 1 X 10 s plaque 
forming units (pfu, as determined by titration on Vero 
cells) were tested in two separate groups (all groups 
5 had 10 mice) . Two groups of control animals received 
equivalent doses of wild-type virus. Mice were bled 
weekly for 16 weeks and pooled sera from each group 
were tested for antibodies to the S and core antigens 
using the immunoassays described in example Section 
10 8.3, above. Anti-vaccinia responses were seen in all 
vaccinia-inoculated mice. Strong anti-S responses 
were seen in both groups of mice which received the 
mixture of recombinants at 1 X 10* pfu, with the 
response in the BALB/c mice comparable to that 
15 generated in response to 1 fxg of the Merck Recombivax 
vaccine, and the anti-S response in the C57BL/6 
slightly lower in magnitude. Anti-core responses were 
seen in both groups of mice which had been inoculated 
with the recombinant viruses at 1 X 10 R , as well. 
2 0 Anti-preS2 and weak anti-preSl responses were seen in 
the BALB/c mice at l X 10 8 pfu of recombinant virus 
mixture, as well. Based on this experiment, BALB/c 
mice were chosen as the better strain in which to 
pursue further immunogenicity studies. It is also 
25 desirable to test the ability of the viruses described 
supra to overcome nonresponsiveness to S in mouse 
strains which are genetic S-nonresponders (for example 
the H-2 S haplotype) . Since no suitable congenic strain 
to BALB/c with an H-2 S haplotype was readily available, 
30 an additional experiment testing the dose response of 
A. BY mice (H-2 b ) was undertaken. These mice, for which 
a suitable H-2 S congenic is available (A.SW) were 
tested for tolerance of vaccinia virus and for 
antibody response to the S-expressing recombinant 
35 virus made from pGS53/S at doses of 5 x 10 8 and 1 x 10 9 
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pfu. A good anti-S response was observed at both 
doses, so that this strain can be used to test the 
ability of recombinant viruses to overcome 
S-nonresponsiveness in mice. 

5 

8 .5. IMMUNOGENICITY ASSESSMENT IN MTCE 

Once a mouse strain and dose of virus have 
been chosen, the immunogenicity of a number of 
individual viruses and combinations of viruses are 
10 compared in mice. The goal is to discover the 

combination of antigens which gives the best immune 
response to all the regions (core, preSl, preS2, and 
S) . The viruses/ combinations of viruses that are 
tested are as follows (other combinations can also be 

15 used; the ratio of one virus to the other can be 

varied to change the ratio of antigens expressed in 
each candidate vaccine) . Doses of vaccinia virus in 
the total range of 10* to 10* total pfu are tested. 
Group 1 receives a mixture of viruses (1:1 or other) 

20 made from pRO— 18 (containing promoter-antigen 
combinations p7.5-LS & pll-coreA8) and pGS53/S2 
(p7 . 5-MS) . Group 2 receives a virus made from pHTI*-30 
(p7.5-coreA8 & modified p7.5-MS), and group 3 receives 
a virus made from pHTL-31 (p7 . 5-preSl-coreA8 and 

25 modified p7.5-MS). Group 4 receives a mixture of 

viruses made from pHTL-26 (modified p7.5-core-preSl*) , 
pHTI#-27 (modif ied p7 . 5-core-preS2 ) , and pHTL-8 
(modified p7.5-S) . Group 5 receives a mixture of 
viruses made from pHTL-26 and pHTL-lOM (modified 

3 0 p7*5-MS). Group 6 receives a mixture of viruses made 
from pHTL-27 and pHTL-8. Group 7 receives a virus 
made from pHTL-35 (modified p7,5-I*S and p7.5-full- 
length core) . Control groups receive viruses from 
plasmids pHTL-32 (p7 . 5-preSl-coreA8) , pHTL-lOM, 

35 pHTL-8, and pSCIO (pll-/3-galactosidase) , respectively. 
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Additional control groups separately receive peptides, 
such as those described for preSl and preS2 in Example 
8.3, above, (or in the case of S, the S-only vaccine, 
Merck's Recombivax, the whole S protein) corresponding 
to the immunodominant epitopes of S, preS2, preSl, and 
core in the presence of adjuvant such as alum. 

As in the earlier dose and strain 
experiments, the immune response of the mice to these 
various combinations of antigens are assessed by use ; 
of the immunoassays described above. One or several 
viruses or combinations of viruses are chosen for 
further study based on the strength of the anti-HBV 
immune response which they generate. Further analysis 
can include testing in strains of mice which are known 
to lack the ability to produce an antibody response to 
S antigen by itself (H-2* mice) , to see if the addition 
of other HBV epitopes such as preSl, preS2 and core 
can overcome S-nonresponsiveness in these mice. These 
same viruses can be analyzed in chimpanzees for the 
ability to protect the animals from challenge with 
hepatitis B virus. A vaccine which proves efficacious 
in chimpanzees can be tested in human clinical trials. 

9. PRELIMINARY RESULTS USING INSERTION 
SITES IN VACCINIA NON-CODING REGIONS 

Three plasmid insertion vectors containing 

HBV sequences flanked by vaccinia non-coding regions 

were constructed, for targeting insertion into the 

F14L-F15L, C12L-C11R, and A53R-A55R regions of the 

30 vaccinia genome, respectively. When two of the 

constructs (regions F14L-F15L and C12L-C11R, 

respectively) were used in attempts to generate 

recombinant vaccinia viruses, no recombinants were 

obtained out of 10,000 viruses screened (using the 

35 blue-white screening method based on /3-galactoside 

expression) . These recombinants would have been 
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identifiable because they form blue plaques when 
overlaid with agarose containing X-gal. With the 
third construct (A53R— A55R) , three blue recombinants 
were found out of 10,000 screened, but when we 
attempted to purify these viruses by picking the blue 
plaques, growing up a small stock, plating again and 
overlaying again, we* found that no blue plaques 
remained. We tentatively conclude that the 
recombinant was either unstable or at a severe growth 
disadvantage relative to the non-recombinant virus. 
However, the strategy we used to select these sites 
can be extended to other noncoding areas. The 
frequency of recombinants might be increased by 
increasing the length of the areas of flanking 
vaccinia DNA in the plasmid, perhaps up to about 500 
bp on each side. Use of such large flanking regions 
may necessitate inclusion of some DNA in these 
flanking regions from the nonessential genes 
surrounding the targeted insertion site. 

10. DEPOSIT OF MICROORGANISMS 
Bacteria carrying plasmid pHTL— 31 encoding 
HBV preSi-core(A8) operably linked to the p7.5 
promoter, and HBV MS (with an upregulated ATG) 
operably linked to the modified p7.5 promoter were 
deposited on November 12, 1992 with the American Type 
Culture Collection (ATCC) , 1201 Park lawn Drive, 
Rockville, Maryland 20852, under the provisions of the 
Budapest Treaty on the International Recognition of 
the Deposit of Microorganisms for the Purposes of 
Patent Procedures, and assigned accession no. 69124. 

The present invention* is not to be limited 
in scope by the microorganisms deposited or the 
specific embodiments described herein. Indeed, 
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various modifications of the invention in addition to 
those described herein will become apparent to those 
skilled in the art from the foregoing description and 
accompanying figures. Such modifications are intended 
5 to fall within the scope of the appended claims. 



Various publications are cited herein, the 



disclosures of which are incorporated by reference in 
their entireties. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



APPLICANT: Souw, Peter T.S. 



0*Keefe, Rhonda W. 
Lewis # Tatyana 
Bernstine, Edward G. 



(11) TITLE OF INVENTIONS Hepatitis B Virus Vaccines 
(111) NUMBER OF SEQUENCES: 59 

(Iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York, 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC— DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: On even date herewith 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Mis rock, S. Leslie 

(B) REGISTRATION NUMBER: 18,872 

(C) REFERENCE / DOCKET NUMBER: 7441-003 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212)790*9090 

(B) TELEFAX: (212)869-8864-9741 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Pro Arg Arg Arg Arg Ser Gin Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION! SEQ ID NO: 2: 
ATAAATAATA AATACAATAA TTAATTTCTC GTAAAAGTAG AAAATATATT CTAATTTATT 60 
GCAC 64 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESSi single 

(D) TOPOLOGY i unknown 

(ii) MOLECULE TOPE: DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 
CTAGAAGCGA TGCTACGCTA GTCACAATCA CCACTTTCAT ATTTAGAATA TATGTATGTA 60 
AAAAT AT AG T AGAATTTCAT TTTGTTTTTT TCTATGCTAT AAAT 104 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 nucleotides 

(B) TYPE: nucleic acid 
<C) STRAND ED NESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AAAAATTGAA ATTTTATTTT TTTTTTTTGG AATATAAATA AG 42 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1170 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY r unknown 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1170 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 5: 

ATG GGG ACG AAT CTT TCT GTT CCC AAT CCT CTG GGA TTC TTT CCC GAT 48 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

CAT CAG TTG GAC CCT GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT TGG 96 
His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Aan Asn Pro Asp Trp 
20 25 30 

GAC TTC AAC CCC ATC AAG GAC CAC TGG CCA GCA GCC AAC CAG GTA GGA 144 
Asp Phe Asn Pro lie Lys Asp His Trp Pro Ala Ala Asn Gin Val Gly 
35 40 45 
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GTG GGA GCA TTC GGG CCA 6GG TTC ACC CCT CCA CAC GGC CGT GTT TTG 192 
Val Gly Ala Phe Gly Pro Gly Phe Thr Pro Pro His Gly Gly Val Leu 
50 55 60 

GGG TGG AGC CCT CAG GCT GAG GGC ATA TTG ACC ACA GTG TCA ACA ATT 240 
Gly Trp Ser Pro Gin Ala Gin Gly lie Leu Thr Thr Val Ser Thr lie 
65 70 75 80 

CCT CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA AGG CAG CCT ACT CCC 288 
Pro Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro 
85 90 95 

ATC TCP CCA CCT CTA AGA GAC AGT CAT CCT CAG GCC ATG CAG TGG AAC 336 
lie Ser Pro Pro Leu Arg Asp Ser His Pro Gin Ala Met Gin Trp Asn 
100 105 110 

TCC ACT GCC TTC CAC CAA GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 384 
Ser Thr Ala Phe His Gin Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 
115 120 125 

TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA ACA GTA AAC CCT GCT CCG 432 
Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly Thr Val Asn Pro Ala Pro 
130 135 140 

AAT ATT GCC TCT CAC ATC TCG TCA ATC TCC GCG AGG ACT GGG GAC CCT 480 
Asn .lie Ala Ser His lie Ser Ser lie Ser Ala Arg Thr Gly Asp Pro 
145 150 155 160 

GTG ACG AAC ATG GAG AAC ATC ACA TCA GGA TTC CTA GGA CCC CTG CTC 528 
Val Thr Asn Met Glu Asn He Thr Ser Gly Phe Leu Gly Pro Leu Leu 
165 170 175 

GTG TTA CAG GCG GGG TTT TTC TTG TTG ACA AGA ATC CTC ACA ATA CCG 576 
Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
180 185 190 

CAG AGT CTA GAC TCG TGG TGG ACT TCT CTC AAT TTT CTA GGG GGA TCA 624 
Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ser 
195 200 205 

CCC GTG TGT CTT GGC CAA AAT TCG CAG TCC CCA ACC TCC AAT CAC TCA 672 
Pro Val Cys Leu Gly Gin Asn Ser Gin ser Pro Thr Ser Asn His Ser 
210 215 220 

CCA ACC TCC TGT CCT CCA ATT TGT CCT GGT TAT CGC TGG ATG TGT CTG 720 
Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
225 230 235 240 

CGG CGT TTT ATC ATA TTC CTC TTC ATC CTG CTG CTA TGC CTC ATC TTC 768 
Arg jucg Phe lie. He Phe Leu Phe He Leu Leu Leu Cys Leu lie Phe 
245 250 255 

TTA TTG GTT CTT CTG GAT TAT CAA GGT ATG TTG CCC GTT TGT CCT CTA 816 
Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu 
260 265 270 

ATT CCA GGA TCA ACA ACA ACC AGT ACG GGA CCA TGC AAA ACC TGC ACG 864 
He Pro Gly Ser Thr Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
275 280 285 

ACT CCT GCT CAA GGC AAC TCT ATG TTT CCC TCA TGT TGC TGT ACA AAA 912 
Thr Pro Ala Gin Gly Asn Ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
290 295 300 

CCT ACG GAT GGA AAT TGC ACC TGT ATT CCC ATC CCA TCG TCT TGG GCT 960 
Pro Thr Asp Gly Asn Cys Thr Cys He Pro He Pro Ser Ser Trp Ala 
305 310 315 320 

TTC GCA AAA TAC CTA TGG GAG TGG GCC TCA GTC CGT TTC TCT TGG CTC 1008 
Phe Ala Lys Tyr Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu 
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325 330 335 

AGT TTA CTA GTG CCA TTT GTT CAG TGG TTC GTA GGG CTT TCC CCC ACT 1056 
Ser Leu Leu Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr 
340 345 350 

GTT TGG CTT TCA GCT ATA TGG ATG JlTG TGG TAT TGG GGG CCA AGT CTG 1104 
Val Trp Leu Ser Ala lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
355 360 365 

TAC AGC ATC GTG AGT CCC TTT ATA CCG CTG TTA CCA ATT TTC TTT TGT 1152 
Tyr Ser lie Val Ser Pro Phe He Pro Leu Leu Pro lie Phe Phe Cys 
370 375 380 

CTC TGG GTA TAC ATT TAA 1170 
Leu Trp Val Tyr He 
385 390 

(2) INFORMATION FOR SEQ ID NO: 6s 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro lie Lys Asp Hie Trp Pro Ala Ala Asn Gin Val Gly 
35 40 45 

Val Gly Ala Phe Gly Pro Gly Phe Thr Pro Pro His Gly Gly Val Leu 
50 55 60 

Gly Trp Ser Pro Gin Ala Gin Gly lie Leu Thr Thr Val Ser Thr lie 
65 70 75 80 

Pro Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro 
85 90 95 

lie Ser Pro Pro Leu Arg Asp Ser His Pro Gin Ala Met Gin Trp Asn 
100 105 110 

Ser Thr Ala Phe His Gin Ala Leu Gin Asp Pro Arg Val Arg Gly Leu 
115 120 125 

Tyr Phe Pro Ala Gly Gly Ser Ser Sex Gly Thr Val Asn Pro Ala Pro 
130 135 140 

Asn lie Ala Ser His lie Ser Ser lie Ser Ala Arg Thr Gly Asp Pro 
145 150 155 160 

Val Thr Asn Met Glu Asn lie Thr Ser Gly Phe Leu Gly Pro Leu Leu 
165 170 175 

Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
180 185 190 

Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ser 
195 200 205 

Pro Val Cys Leu Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser 
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210 215 220 

Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
225 230 235 240 

Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 
245 250 255 

Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu 
260 265 270 

lie Pro Gly Ser Thr Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
275 280 285 

Thr Pro Ala Gin Gly Asn Ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
290 295 300 

Pro Thr Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala 
305 310 315 320 

Phe Ala Lys Tyr Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu 
325 330 335 

Ser Leu Leu Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr 
340 345 350 

Val Trp Leu Ser Ala lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
355 360 365 

Tyr ser lie Val Ser Pro Phe lie Pro Leu Leu Pro lie Phe Phe Cys 
370 375 380 

Leu Trp Val Tyr lie 
385 



(2) INFORMATION FOR SBQ ZD MO: 7: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
<B) LOCATION: 1..846 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG CAG TGG AAC TCC ACT GCC TTC CAC CAA GCT CTG CAG GAT CCC AGA 48 
Met Gin Trp Asn Ser Thr Ala Phe His Gin Ala Leu Gin Asp Pro Arg 
15 10 15 

GTC AGG GGT CTG TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA AGA GTA 96 
Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly Thr Val 
20 25 30 

AAC CCT GCT CCG AAT ATT GCC TCT CAC ATC TOG TCA ATC TCC GCG AGG 144 
Asn Pro Ala Pro Asn lie Ala Ser His lie Ser Ser lie Ser Ala Arg 
35 40 45 

ACT GGG GAC CCT GTG ACG AAC ATG GAG AAC ATC ACA TCA GGA TTC CTA 192 
Thr Gly Asp Pro Val Thr Asn Met Glu Asn lie Thr Ser Gly Phe Leu 
50 55 60 
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GGA CCC CTG CTC GTG TTA CAG GCG GGG TTT TTC TTG TTG ACA AGA ATC 240 

Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie 
65 70 75 80 

CTC ACA ATA CCG CAG AGT CTA GAC TCG TGG TGG ACT TCT CTC AAT TTT 288 
Leu Thr He Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe 
85 90 95 

CTA GGG GGA TCA CCC GTG TGT CTT GGC CAA AAT TCG CAG TCC CCA ACC 336 
Leu Gly Gly Ser Pro Val Cys Leu Gly Gin Asn Ser Gin Ser Pro Thr 
100 105 110 

TCC AAT CAC TCA CCA ACC TCC TGT CCT CCA ATT TGT CCT GGT TAT CGC 384 
Ser Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg 
115 120 125 

TGG ATG TGT CTG CGG CGT TTT ATC ATA TTC CTC TTC ATC CTG CTG CTA 432 
Trp Met Cys Leu Arg Arg Phe He lie Phe Leu Phe lie Leu Leu Leu 
130 135 140 

TGC CTC ATC TTC TTA TTG GTT CTT CTG GAT TAT CAA GGT ATG TTG CCC 480 
Cys Leu lie Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro 
145 150 155 160 

GTT TGT CCT CTA ATT CCA GGA TCA ACA ACA ACC AGT ACG GGA CCA TGC 528 
Val • Cys Pro Leu lie Pro Gly Ser Thr Thr Thr Ser Thr Gly Pro Cys 
165 170 175 

AAA ACC TGC ACG ACT CCT GCT CAA GGC AAC TCT ATG TTT CCC TCA TGT 576 
Lys Thr Cys Thr Thr Pro Ala Gin Gly Asn Ser Met Phe Pro Ser Cys 
180 185 190 

TGC TGT ACA AAA CCT ACG GAT GGA AAT TGC ACC TGT ATT CCC ATC CCA 624 
Cys Cys Thr Lys Pro Thr Asp Gly Asn Cys Thr Cys lie Pro lie Pro 
195 200 205 

TCG TCT TGG GCT TTC GGA AAA TAC CTA TGG GAG TGG GCC TCA GTC CGT 672 
Ser Ser Trp Ala Phe Ala Lys Tyr Leu Trp Glu Trp Ala Ser Val Arg 
210 215 220 

TTC TCT TGG CTC AGT TTA CTA GTG CCA TTT GTT CAG TGG TTC GTA GGG 720 
Phe Ser Trp Leu Ser Leu Leu Val Pro Phe Val Gin Trp Phe Val Gly 
225 230 235 240 

* CTT TCC CCC ACT GTT TGG CTT TCA GCT ATA TGG ATG ATG TGG TAT TGG 768 
Leu Ser Pro Thr Val Trp Leu Ser Ala lie Trp Met Met Trp Tyr Trp 
245 250 255 

GGG CCA AGT CTG TAC AGC ATC GTG AGT CCC TTT ATA CCG CTG TTA CCA 816 
Gly Pro Ser Leu Tyr Ser lie Val Ser Pro Phe lie Pro Leu Leu Pro 
260 265 270 

ATT TTC TTT TGT CTC TGG GTA TAC ATT TAA 846 
lie Phe Phe Cys Leu Trp Val Tyr lie 
275 280 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii> MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Gin Trp Asn Ser Thr Ala Phe His Gin Ala Leu Gin Asp Pro Arg 
15 10 15 
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Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly Thr Val 
20 25 30 

Aan Pro Ala Pro Asn lie Ala Ser His He Ser Ser He Ser Ala Arg 
35 40 45 

Thr Gly Asp Pro Val Thr Asn Met Glu Asn lie Thr Ser Gly Phe Leu 
50 55 60 

Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg He 
65 70 75 80 

Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe 
85 90 95 

Leu Gly Gly Ser Pro Val Cys Leu Gly Gin Asn Ser Gin Ser Pro Thr 
100 105 HO 

Ser Asn HpLs Ser Pro Thr Ser* cys Pro Pro He Cys Pro Gly Tyr Arg 
115 120 125 

Trp Met Cys Leu Arg Arg Phe He He Phe Leu Phe He Leu Leu Leu 
130 135 140 

Cys Leu He Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met: Leu Pro 
145 ISO 155 160 

Val Cys Pro Leu He Pro Gly Ser Thr Thr Thr Ser Thr Gly Pro cys 
165 170 175 

Lys Thr Cys Thr Thr Pro Ala Gin Gly Asn Ser Met Phe Pro Ser Cys 
180 185 190 

Cys Cys Thr Lys Pro Thr Asp Gly Asn Cys Thr Cys He Pro He Pro 
195 200 205 

Ser Ser Trp Ala Phe Ala Lys Tyr Leu Trp Glu Trp Ala Ser Val Arg 
210 215 220 

Phe Ser Trp Leu Ser Leu Leu Val Pro Phe Val Gin Trp Phe Val Gly 
225 230 235 240 

Leu Ser Pro Thr Val Trp Leu Ser Ala He Trp Met Met Trp Tyr Trp 
245 250 255 

Gly Pro Ser Leu Tyr Ser He Val Ser Pro Phe He Pro Leu Leu Pro 
260 265 270 

He Phe Phe Cys Leu Trp Val Tyr He 
275 280 

(2) INFORMATION FOR SEQ ID NO: 9: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 681 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(li) MOLECULE TYPE: DNA 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..681 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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ATG GAG AAC ATC ACA TCA GGA TTC CTA GGA CCC CTG CTC GTG TTA CAG 48 
Mat Glu Asn He Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
15 10 15 

GCG GGG TTT TTC TTG TTG ACA AGA ATC CTC ACA ATA CCG CAG AGT CTA 96 
Ala Gly Phe Phe Leu Leu Thr Arg He Leu Thr He Pro Gin Ser Leu 
20 25 30 

GAC TCG TGG TGG ACT TCT CTC AAT TTT CTA GGG GGA TCA CCC GTG TGT 144 
Asp Ser Trp Trp Thr Ser Leu Aon Phe Leu Gly Gly Ser Pro Val Cys 
35 40 45 

CTT GGC CAA AAT TCG CAG TCC CCA ACC TCC AAT CAC TCA CCA ACC TCC 192 
Leu Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser 
50 55 60 

TGT CCT CCA ATT TGT CCT GGT TAT CGC TGG ATG TGT CTG CGG CGT TTT 240 
Cys Pro Pro He Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
65 70 75 80 

ATC ATA TTC CTC TTC ATC CTG CTG CTA TGC CTC ATC TTC TTA TTG GTT 288 
He He Phe Leu Phe He Leu Leu Leu Cys Leu He Phe Leu Leu Val 
85 90 95 

CTT CTG GAT TAT CAA GGT ATG TTG CCC GTT TGT CCT CTA ATT CCA GGA 336 
Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu He Pro Gly 
100 105 110 

TCA ACA ACA ACC AGT ACG GGA CCA TGC AAA ACC TGC ACG ACT CCT GCT 384 
Ser Thr Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr Thr Pro Ala 
115 120 125 

CAA GGC AAC TCT ATG TTT CCC TCA TGT TGC TGT ACA AAA CCT ACG GAT 432 
Gin Gly Asn Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Thr Asp 
130 135 140 

GGA AAT TGC ACC TGT ATT CCC ATC CCA TCG TCT TGG GCT TTC GCA AAA 480 
Gly Asn Cys Thr Cys He Pro He Pro Ser Ser Trp Ala Phe Ala Lys 
145 150 155 160 

TAC CTA TGG GAG TGG GCC TCA GTC CGT TTC TCT TGG CTC AGT TTA CTA 528 
Tyr Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu 
165 170 175 

GTG CCA TTT GTT CAG TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT 576 
Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu 
180 185 190 

TCA GCT ATA TGG ATG ATG TGG TAT TGG GGG CCA AGT CTG TAC AGC ATC 624 
-Ser A la. He Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser He 
195 200 205 

GTG AGT CCC TTT ATA CCG CTG TTA CCA ATT TTC TTT TGT CTC TGG GTA 672 
Val Ser Pro Phe He Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val 
210 215 220 

TAC ATT TAA 681 

Tyr He 

225 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NOrlO: 

Met Clu Aon lie Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
15 10 15 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr He Pro Gin Ser Leu- 
20 25 30 

Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ser Pro Val Cys 
35 40 45 

Leu Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn Hie Ser Pro Thr Ser 
50 55 6 ? 

Cys Pro Pro He Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
65 70 75 80 

He He Phe Leu Phe He Leu Leu Leu Cye Leu He Phe Leu Leu Val 
85 90 95 

Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu He Pro Gly 
100 105 110 

Ser Thr Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr Thr Pro Ala 
115 120 i25 

Gin Gly Asn Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Thr Asp 
130 135 140 

Gly Asn Cys Thr Cys He Pro He Pro Ser Ser Trp Ala Phe Ala Lys 
145 150 155 160 

Tyr Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu 
165 170 175 

Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu 
180 185 190 

Ser Ala He Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Ser He 
195 200 205 

Val Ser Pro Phe He Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val 
210 215 220 

Tyr He 
225 

(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 nucleotides 

(B) TYPE: nucleic acid 

(C) SRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 11: 
GGAGGATCCA GCATCAAGG 19 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 534 nucleotides 

(B ) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.534 



(xi) SEQUENCE DESCRIPTION I SEQ ZD NO: 12: 

ATG GAC ATT GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC 48 
Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

TCG TTT TTG CCT TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

TCA CCT CAC CAC ACC GCA CTC AGG CAA GCC ATT CTC TGC TGG GGG GAA 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
50 55 60 

TTG ATG ACT CTA GCT ACC TGG GTG GGT AAT AAT TTG GAG GAT CCA GCA 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Asn Aan Leu Glu Asp Pro Ala 
65 70 75 80 

TCA AGG GAT CTA GTA GTC AAT TAT GTT AAT ACT AAC ATG GGT TTA AAA 288 
Ser Arg Asp Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

ATT AGG CAA CTA TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA 336 
He Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

GAG ACT GTA CTT GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

CCT CCA GCC TAT AGA CCA CCA AAT GCC CCT ATC TTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 

GAA ACT ACT GTT GTT AGA CGA CGG GAC CGA GGC AGG TCC CCT AGA AGA 480 
Glu Thr Thr Val Val Arg Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg 
145 150 155 160 

AGA ACT CCC TCG CCT CGC AGA CGC AGA TCT CAA TCT CGG GAA TCT CAA 528 
Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg Glu Ser Gin 
165 170 175 

TGT TAG 534 
Cys 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Asp lie Asp Pro Tyr Lye Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe frro Ser Val Arg Aep Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His Bis Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Aen Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Aen Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

He Arg Gin Leu Leu Trp Phe Bis He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 

* Glu Thr Thr Val Val Arg Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg 
145 150 155 160 

Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Arg Glu Ser Gin 
165 170 175 

Cys 



(2) INFORMATION FOR SEQ ID NO: 14 : 

(1) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 558 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 
.(B) LOCATION: 1..558 

(xl) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

ATG GAC ATT GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC 48 
Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

TCG TTT TTG CCT TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 
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TCA CCT CAC CAC ACC GCA CTC AGG CAA GCC ATT CTC TGC TGG GGG GAA 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

TTG ATG ACT CTA GCT ACC TGG GTG GGT AAT AAT TTG GAG GAT CCA GCA 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala 
65 70 75 80 

TCA AGG GAT CTA GTA GTC AAT TAT GTT AAT ACT AAC ATG GGT TTA AAA 288 
Ser Arg Asp Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lye 
85 90 95 

ATT AGG CAA CTA TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA 336 
lie Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

GAG ACT GTA CTT GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

CCT CCA GCC TAT AGA CCA CCA AAT GCC CCT ATC TTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 

GAA ACT ACT GTT GTT AGA OGA CGG GAC CGA GGC AGG TCC CCT AGA AGA 480 
Glu Thr Thr Val Val Arg Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg 
145 150 155 160 

AGA ACT CCC TCG CCT CGC AGA CGC AGA TCC CAA TCG CCG CGT CGC AGA 528 
Arg Thr Pro Ser Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg 
165 170 175 

CGA TCT CAA TCT CGG GAA TCT CAA TGT TAG 558 
Arg Ser Gin Ser Arg Glu Ser Gin Cys 
180 185 

(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

lie Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
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115 



120 



125 



Pro Pro Ala Tyr 
130 



Arg Pro 



Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
135 140 



Glu Thr Thr Val 
145 



Val Arg 
150 



Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg 
155 160 



Arg Thr Pro Ser 



Pro Arg 
165 



Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg 
170 175 



Arg Ser Gin Ser 



Arg Glu 



Ser Gin Cyo 
185 



180 



<2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 nucleotides 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
GATCCCAATC GCGGCGTCGC AGAC 24 
(2) INFORMATION FOR SEQ ID NO: 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GATCGTCTGC GACGCGGCGA TTGG 24 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 10 nucleotides 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCCTCGAGGG 10 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 19: 
CTCACTCGAG GGAATAAGCT 
(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRAND ED NES S : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTGTTAGGCC TTAAATGTAT 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 nucleotides 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION x SEQ ID NO:21: 
GAGCTCGGTA CCACCATGAA GTGGA 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAGCAGGGGT CCTAGGAATC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



17:57:02 



WO 94/12617 PCT/US93/11474 

- 123 - 

TCACTATCCG GAATCAGCTT 20 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: DNA 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO:24x 
GGTGAATTCT GGCCCGAATG 20 
(2) INFORMATION FOR SEQ ID NO: 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..627 

(xl) SEQUENCE DESCRIPTION: SEQ ZD NO: 25* 

ATG GAC ATT GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTG 48 
Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

TCG TTT TTG CCT TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser yal Arg Asp Leu Leu Asp 
20 25 30 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

TCA CCT CAC CAC ACC GCA CTC AGG CAA GCC ATT CTC TGC TGG GGG GAA 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
SO 55 60 

TTG ATG ACT CTA GCT ACC TGG GTG GGT AAT AAT TTG GAG GAT CCA GCA 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala 
65 70 75 80 

TCA AGG GAT CTA GTA GTC AAT TAT GTT AAT ACT AAC ATG GGT TTA AAA 288 
Ser Arg Asp Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

ATT AGG CAA CTA TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA 336 
lie Arg Gin Leu Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

GAG ACT GTA CTT GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Vai Trp He Arg Thr 
115 120 125 

CCT CCA GCC TAT AGA CCA CCA AAT GCC CCT ATC TTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
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130 135 

GAA TCA GCT TGC ATG GGG ACG AAT CTT 
Glu Ser Ala Cys Met Gly Thr Asn Leu 
145 150 

TTC TTT CCC GAT CAT CAG TTG GAC CCT 
Phe Phe Pro Asp His Gin Leu Asp Pro 
165 

AAT CCA GAT TGG GAC TTC AAC CCC ATC 
Asn Pro Asp Trp Asp Phe Asn Pro lie 
180 185 

AAC CAG GTA GGA GTG GGA GCA TTC GGG 
Asn Gin Val Gly Val Gly Ala Phe Gly 
195 200 

TAA 
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140 

TCT GTT CCC AAT CCT CTG GGA 480 
Ser Val Pro Asn Pro Leu Gly 
155 160 

GCA TTC GGA GCC AAC TCA AAC 528 
Ala Phe Gly Ala Asn Ser Asn 
170 175 

AAG GAC CAC TGG CCA GCA GCC 576 
Lys Asp His Trp Pro Ala Ala 
190 

CCA GAA TTC AGG CCT ACT AGT 624 
Pro Glu Phe Arg Pro Thr Ser 
205 

627 



(2) INFORMATION FOR SEQ ID NO: 26; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala 
65 70 75 80 

Ser Arg Asp Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

lie Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

Glu Ser Ala Cys Met Gly Thr Asn Leu Ser Val -Pro Asn Pro Leu Gly 
145 150 155 160 

Phe Phe Pro Asp His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn 
165 170 175 

Asn Pro Asp Trp Asp Phe Asn Pro lie Lys Asp His Trp Pro Ala Ala 
180 185 190 
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Asn Gin Val Gly Val Gly Ala Phe Gly Pro Glu Phe Arg Pro Thr Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CATCCTCCGG ACATGCAGTG 20 

(2) INFORMATION FOR SEQ ID NO: 28: 

. (i) SEQUENCE CHARACTERISTICS: 

| A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 J 
GTCCTAGGAA TTCTGATGTG 20 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 639 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 

(ix> FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..639 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATG GAC ATT GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC 48 
Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

TCG TTT TTG CCT TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

TCA CCT CAC CAC ACC GCA CTC AGG CAA GCC ATT CTC TGC TGG GGG GAA 192 
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Ser Pro Hie His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 6Q 

TTG ATG ACT CTA GCT ACC TGG GTG GGT AAT AAT TTG GAG GAT CCA CCA 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala 
65 70 75 80 

TCA AGG GAT CTA GTA GTC AAT TAT GTT AAT ACT AAC ATG GGT TTA AAA 288 
Ser Arg Asp Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

ATT AGG CAA CTA TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA 336 
lie Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

GAG ACT GTA CTT GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr 
115 120 125 

CCT CCA GCC TAT AGA CCA CCA AAT GCC CCT ATC TTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro 
130 135 140 



GAC ATG CAG TGG AAC TCC ACT GCC TTC CAC CAA GCT CTG CAG GAT CCC 
Asp Met Gin Trp Asn Ser Thr Ala Phe His Gin Ala Leu Gin Asp Pro 
145 150 155 160 



480 



AGA GTC AGG GGT CTG TAT TTT CCT GCT GGT GGC TCC AGT TCA GGA ACA 528 
Arg Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly Thr 
165 170 175 

GTA AAC CCT GCT CCG AAT ATT GCC TCT CAC ATC TCG TCA ATC TCC GCG 576 
Val Asn Pro Ala Pro Aon He Ala Ser His lie Ser Ser lie Ser Ala 
180 185 190 

AGG ACT GGG GAC CCT GTG ACG AAC ATG GAG AAC ATC ACA TCA GAA TTC 624 
Arg Thr Gly Asp Pro Val Thr Asn Met Glu Asn lie Thr Ser Glu Phe 
195 200 205 

AGG CCT ACT AGT TAA 639 
Arg Pro Thr Ser 
210 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Asp He Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 is 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala 
65 70 75 80 
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ser Arg Asp Leu Val Val Asn Tyr VaX Asn Thr Asn Met Gly Leu Lya 
85 90 95 

lie Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu Thr Val Leu Glu Tyr Leu Val -Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

Asp Met Gin Trp Asn Ser Thr Ala Phe His Gin Ala Lev Gin Asp Pro 
145 150 155 160 

Arg Val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly Thr 
165 170 175 

Val Asn Pro Ala Pro Asn Zle Ala Ser Bis lie Ser Ser He Ser Ala 
180 185 190 

Arg Thr Gly Asp Pro Val Thr Asn Met Glu Asn lie Thr Ser Glu Phe 
195 200 205 

Arg Pro Thr Ser 
210 

(2) INFORMATION FOR SBQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 20 nucleotides 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY x unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 31s 
GTATGTTTCC GGATTGTCCT 20 
(2) INFORMATION FOR SSQ ID NO*32: 

(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTHS 20 nucleotides 

(B) TYPES nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TGAGGCCGAA TTCCATAGGT 20 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 
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( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATG GAC ATT GAC CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC 48 
Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
15 10 15 

TOG TTT TTG CCT TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC 96 
Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC 144 
Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

TCA CCT CAC CAC ACC GCA CTC AGG GAA GCC ATT CTC TGC TGG GGG GAA 192 
Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

TTG ATG ACT CTA GCT ACC TGG GTG GGT AAT AAT TTG GAG GAT CCA GCA 240 
Leu Met Thr Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala 
65 70 75 80 

TCA AGG GAT CTA GTA GTC AAT TAT GTT AAT ACT AAC ATG GGT TTA AAA 288 
Ser Arg Asp Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

ATT AGG GAA CTA TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA 336 
lie Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

GAG ACT GTA CTT GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT 384 
Glu Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

CCT CCA GCC TAT AGA CCA CCA AAT GCC CCT ATC TTA TCA ACA CTT CCG 432 
Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

GAT TGT CCT CTA ATT CCA GGA TCA ACA ACA ACC AGT ACG GGA CCA TGC 480 
Asp Cys Pro Leu lie Pro Gly Ser Thr Thr Thr Ser Thr Gly Pro Cys 
145 150 155 160 

AAA ACC TGC ACG ACT CCT GCT CAA GGC AAC TCT ATG TTT CCC TCA TGT 528 
Lys Thr Cys Thr Thr Pro Ala Gin Gly Asn Ser Met Phe Pro Ser Cys 
165 170 175 

TGC TGT ACA AAA CCT ACG GAT GGA AAT TGC ACC TGT ATT CCC ATC CCA 576 
Cys Cys Thr Lys Pro Thr Asp Gly Asn Cys Thr Cys He Pro He Pro 
180 185 190 

TCG TCT TGG GCT TTC GCA AAA TAC CTA TGG AAT TCA GGC CTA CTA GTT 624 
Ser Ser Trp Ala Phe Ala Lys Tyr Leu Trp Asn Ser Gly Leu Leu Val 
195 200 205 

AAG TAA 

L y s 630 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 209 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 34: 

Met Asp lie Asp Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu 
1 5 10 15 

Ser Phe Leu Pro Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp 
20 25 30 

Thr Ala Ser Ala Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys 
35 40 45 

Ser Pro His His Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu 
50 55 60 

Leu Met Thr Leu Ala Thr Trp Val Gly Asn Asn Leu Glu Aep Pro Ala 
65 70 75 SO 

Ser Arg Asp Leu Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys 
85 90 95 

lie Arg Gin Leu Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg 
100 105 110 

Glu -Thr Val Leu Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr 
115 120 125 

Pro Pro Ala Tyr Arg Pro Pro Asn Ala Pro lie Leu Ser Thr Leu Pro 
130 135 140 

Asp Cys Pro Leu lie Pro Gly Ser Thr Thr Thr Ser Thr Gly Pro Cys 
145 ISO 155 160 

Lys Thr Cys Thr Thr Pro Ala Gin Gly Asn Ser Met Phe Pro Ser Cys 
165 170 175 

Cys Cys Thr Lys Pro Thr Asp Gly Asn Cys Thr Cys lie Pro lie Pro 
180 185 190 

Ser Ser Trp Ala Phe Ala Lys Tyr Leu Trp Asn Ser Gly Leu Leu Val 
195 200 205 

Lys 



(2) INFORMATION FOR SEQ ID NO: 35 £ 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CGAGGAATTT ATTTATAGC 19 
(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : u n known 
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(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CTCACTCGAG GGAATAAGCT 
(2) INFORMATION FOR SEQ ID NO: 372 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPEx DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 2 
TTGTTAGGCC TTAAATGTAT 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
GTGGGTAAGC TTCTCGATGT 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS 2 single 

(D) TOPOLOGY s unknown 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AGTTTCCAAG CTTATGAG 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
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GCGCGATGGA CATTGACCCT TATA 



24 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t single 
<D) TOPOLOGY: unknown 

(11) MOLECULE TYPE: DNA 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCCTGATCAC TAACATTGAG ATTCCCGA 28 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 nucleotides 

(B) TYPBs nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii> MOLECULE TYPES DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
ATTCTCGAGG GCATGGGGAC G 21 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 20 nucleotides 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS x single 
(D) TOPOLOGY r unknown 

(ii) MOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
ACTCCATGGC CTGAGGATGA 20 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE* DNA 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..858 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

ATG GGG ACG AAT CTT TCT GTT CCC AAT CCT CTG GGA TTC TTT CCC GAT 48 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
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15 10 15 

CAT GAG TTG GAC CCT GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT TGG 96 
His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

GAC TTC AAC CCC ATC AAG GAC CAC TGG CCA GCA GCC AAC CAG GTA GGA - 144 
Asp Phe Asn Pro He Lys Asp His Trp Pro Ala Ala Asn Gin Val Gly 
35 40 45 

GTG GGA GCA TTC GGG CCA GGG TTC ACC CCT CCA CAC GGC GGT GTT TTG 192 
Val Gly Ala Phe Gly Pro Gly Phe Thr Pro Pro His Gly Gly Val Leu 
50 55 60 

GGG TGG AGC CCT CAG GCT CAG GGC ATA TTG ACC ACA GTG TCA ACA ATT 240 
Gly Trp Ser Pro Gin Ala Gin Gly lie Leu Thr Thr Val Ser Thr lie 
65 70 75 80 

CCT CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA AGG CAG CCT ACT CCC 288 
Pro Pro Pro Ala Ser Thr Asn Arg Gin ser Gly Arg Gin Pro Thr Pro 
85 90 95 

ATC TCT CCA CCT CTA AGA GAC AGT CAT CCT CAG GCC ATG GAC ATT GAC 336 
lie Ser Pro Pro Leu Arg Asp Ser His Pro Gin Ala Met Asp lie Asp 
100 105 HO 

CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC TCG TTT TTG CCT 384 
Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu Pro 
115 120 125 

TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC ACC GCC TCA GCT 432 
Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser Ala 
130 135 140 

CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC TCA CCT CAC CAC 480 
Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His His 
145 150 155 160 

ACC GCA CTC AGG CAA GCC ATT CTC TGC TGG GGG GAA TTG ATG ACT CTA 528 
Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr Leu 
165 170 175 

GCT ACC TGG GTG GGT AAT AAT TTG GAG GAT CCA GCA TCA AGG GAT CTA 576 
Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala Ser Arg Asp Leu 
180 185 190 

GTA GTC AAT TAT GTT AAT ACT AAC ATG GGT TTA AAA ATT AGG CAA CTA 624 
Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys lie Arg Gin Leu 
195 200 205 

TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA GAG ACT GTA CTT 672 
Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val Leu 
210 215 220 

GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT CCT CCA GCC TAT 720 
Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala Tyr 
225 230 235 240 

AGA CCA CCA AAT GCC CCT ATC TTA TCA ACA CTT CCG GAA ACT ACT GTT 768 
Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr Val 
245 250 255 

GTT AGA CGA CGG GAC CGA GGC AGG TCC CCT AGA AGA AGA ACT CCC TCG 816 
Val Arg Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser 
260 265 270 

CCT CGC AGA CGC AGA TCT CAA TCT CGG GAA TCT CAA TGT TAG 858 
Pro Arg Arg Arg Arg Ser Gin Ser Arg Glu Ser Gin Cys 
275 280 285 
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(2) INFORMATION FOR SEQ ID NO:45: 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTH : 285 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY r unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Cly Phe Phe Pro Asp 
* 5 lo 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro lie Lys Asp His Trp Pro Ala Ala Asn Gin Val Gly 
35 40 45 

Val Gly Ala Phe Gly Pro Gly Phe Thr Pro Pro His Gly Gly Val Leu 
50 55 60 

Gly Trp Ser Pro Gin Ala Gin Gly lie Leu Thr Thr Val Ser Thr lie 
65 70 75 80 

Pro Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro 
85 90 95 

Zle Ser Pro Pro Leu Arg Asp Ser His Pro Gin Ala Met Asp lie Asp 
100 lOS no 

Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu Pro 
115 120 125 

Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser Ala 
130 135 140 

Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His His 
145 150 155 160 

Thr Ala Leu Arg Gin Ala lie Leu Cys Trp Gly Glu Leu Met Thr Leu 
165 170 175 

Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala Ser Arg Asp Leu 
180 185 190 

Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys lie Arg Gin Leu 
195 200 205 

Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val Leu 
210 215 220 

Glu Tyr Leu Val Ser Phe Gly Val Trp lie Arg Thr Pro Pro Ala Tyr 
225 230 235 240 

Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr Val 
245 250 255 

Val Arg Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser 
260 265 270 

Pro Arg Arg Arg Arg Ser Gin Ser Arg Glu Ser Gin Cys 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 nucleotides 
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(B) TYPE j nucleic acid 
<C) STRAND ED NESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GCGCCATGGA CATTGACCCT TATA 24 
(2) INFORMATION FOR SEQ ID NO: 47: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 nucleotides 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
AGTGAAGCTT CCCACCTT 18 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

ATG GGG ACG AAT CTT TCT GTT CCC AAT CCT CTG GGA TTC TTT CCC GAT 48 
Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

CAT GAG TTG GAC CCT GCA TTC GGA GCC AAC TCA AAC AAT CCA GAT TGG 96 
His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

GAC TTC AAC CCC ATC AAG GAC CAC TGG CCA GCA GCC AAC CAG GTA GGA 144 
Asp Phe Asn Pro lie Lys Asp His Trp Pro Ala Ala Asn Gin Val Gly 
35 40 45 

GTG GGA GCA TTC GGG CCA GGG TTC ACC CCT CCA CAC GGC GGT GTT TTG 192 
Val Gly Ala Phe Gly Pro Gly Phe Thr Pro Pro His Gly Gly Val Leu 
50 55 60 

GGG TGG AGC CCT CAG GCT CAG GGC ATA TTG ACC ACA GTG TCA ACA ATT 240 
Gly Trp Ser Pro Gin Ala Gin Gly lie Leu Thr Thr Val Ser Thr He 
65 70 75 80 

CCT CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA AGG CAG CCT ACT CCC 288 
Pro Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro 
85 90 95 
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ATC TCT CCA CCT CTA AGA GAC AGT CAT CCT CAG GCC ATG GAC ATT GAC 336 
lie Ser Pro Pro Leu Arg Asp Ser His Pro Gin Ala Met Asp He Asp 
100 105 HO 

CCT TAT AAA GAA TTT GGA GCT ACT GTG GAG TTA CTC TCG TTT TTG CCT 384 
Pro Tyr Lye Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu Pro 
115 120 125 

TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC ACC GCC TCA GCT 432 
Ser Aap Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser Ala 
130 135 140 

CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC TCA CCT CAC CAC 480 
Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His His 
"5 ISO 155 160 

ACC GGA CTC AGG GAA GCC ATT CTC TGC TGG GGG GAA TTG ATG ACT CTA 528 
Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr Leu 
165 170 175 

GCT ACC TGG GTG GGT AAT AAT TTG GAG GAT CCA GGA TCA AGG GAT CTA 576 
Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala Ser Arg Asp Leu 
180 185 190 

GTA GTC AAT TAT GTT AAT ACT AAC ATG GGT TTA AAA ATT AGG CAA CTA 624 
Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys lie Arg Gin Leu 
195 200 205 

TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA GAG ACT GTA CTT 672 
Leu Trp Phe His He Ser Cys Leu Thr Phe Gly Arg Glu Thr Val Leu 
210 215 220 

GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT CCT CCA GCC TAT 720 
Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala Tyr 
225 230 235 240 

AGA CCA CCA AAT GCC CCT ATC TTA TCA AGA CTT CCG GAA ACT ACT GTT 768 
Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr Val 
245 250 255 

GTT AGA CGA CGG GAC GGA GGC AGG TCC CCT AGA AGA AGA ACT CCC TCG 816 
Val Arg Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser 
260 265 270 

CCT CGC AGA CGC AGA TCC CAA TCG CCG GGT CGC AGA CGA TCT CAA TCT 664 
Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser 
275 280 285 

CGG GAA TCT CAA TGT TAG 882 
Arg Glu Ser Gin Cys 
290 

(2) INFORMATION FOR SEQ ID NO: 49: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 
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Asp Phe Asn Pro He Lys Asp His Trp Pro Ala Ala Asn Gin Val Gly 
35 40 45 

Val Gly Ala Phe Gly Pro Gly . Phe Thr Pro Pro His Gly Gly Val Leu 
50 55 60 

Gly Trp Ser Pro Gin Ala Gin Gly -He Leu Thr Thr Val Ser Thr He 
65 70 75 80 

Pro Pro Pro Ala Ser Thr Asn Arg Gin Ser Gly Arg Gin Pro Thr Pro 
85 90 95 

, He Ser Pro Pro Leu Arg Asp Ser His Pro Gin Ala Met Asp lie Asp 
1O0 105 110 

Pro Tyr Lys Glu Phe Gly Ala Thr Val Glu Leu Leu Ser Phe Leu Pro 
115 120 125 

Ser Asp Phe Phe Pro Ser Val Arg Asp Leu Leu Asp Thr Ala Ser Ala 
130 135 140 

Leu Tyr Arg Glu Ala Leu Glu Ser Pro Glu His Cys Ser Pro His His 
145 150 155 160 

Thr Ala Leu Arg Gin Ala He Leu Cys Trp Gly Glu Leu Met Thr Leu 
165 170 175 

Ala Thr Trp Val Gly Asn Asn Leu Glu Asp Pro Ala Ser Arg Asp Leu 
180 185 190 

Val Val Asn Tyr Val Asn Thr Asn Met Gly Leu Lys He Arg Gin Leu 
195 200 205 

Leu Trp Phe His lie Ser Cys Leu Thr Phe Gly Arg Glu Thr Val Leu 
210 215 220 

Glu Tyr Leu Val Ser Phe Gly Val Trp He Arg Thr Pro Pro Ala Tyr 
225 230 235 240 

Arg Pro Pro Asn Ala Pro He Leu Ser Thr Leu Pro Glu Thr Thr Val 
245 250 255 

Val Arg Arg Arg Asp Arg Gly Arg Ser Pro Arg Arg Arg Thr Pro Ser 
260 265 270 

Pro Arg Arg Arg Arg Ser Gin Ser Pro Arg Arg Arg Arg Ser Gin Ser 
275 280 285 

Arg Glu Ser Gin Cys 
290 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) HOLECULE TYPE: DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GTTGGTAGAA TTCCAATTAT 20 
(2) INFORMATION FOR SEQ ID NO: 51: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 
AAGAATGATC GATACAGTTT 
(2) INFORMATION FOR SEQ ID NO: 52s 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
GTACCCCGAA TTCATACTTA 
(2) INFORMATION FOR SEQ ID NO: 53s 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
AATAATATAT CGATAATTGT 
' (2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(it) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GTTGGAATTC CGCTACTGAT 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

TAACCAAGTA TCGATATAAT 20 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 4 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Arg Pro Thr Ser 
1 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Asn Ser Gly Leu Leu Val Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5B: 
CTCGATGTCG ACTAGCCATA 20 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
AAGTTGTCGA CCTTATGAGT 
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WHAT IS CLAIMED IS: 



1- A recombinant replicable vaccinia 



virus, the genome of which comprises: 



5 



(a) a first nucleotide sequence encoding at 
least one epitope of a hepatitis B 
virus surface antigen; 



(b) a first promoter sequence operably 
linked to said first nucleotide 



XO 



sequence ; 

a second nucleotide sequence encoding 
at least one epitope of a hepatitis B 
virus core antigen; and 
a second promoter sequence operably 
linked to said second nucleotide 



(c) 



(d) 



15 



sequence ; 

whereby the first and second nucleotide sequences are 
expressed by the virus in a suitable host. 



nucleotide sequence encodes an amino acid sequence 
comprising a protein selected from the group 
consisting of a hepatitis B virus S, MS, LS, preS2, 
preSl, derivatives of any of the foregoing displaying 

25 the antigenicity of any of the foregoing, and fusion 
proteins consisting of combinations of S, MS, LS, 
preS2, preSl, and derivatives of any of the foregoing 
displaying the antigenicity of any of the foregoing; 
and the second nucleotide sequence encodes an amino 

30 acid sequence comprising a protein selected from the 
group consisting of core antigen, e antigen, and 
derivatives thereof displaying the antigenicity of 
core antigen or e antigen. 



20 



2. 



The virus of claim 1 in which the first 



35 
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3 . The virus of claim 1 which is 
infectious in the host without causing significant 
disease. 

5 4. A recombinant replicable vaccinia 

virus, the genome of which comprises: 

• (a) a first nucleotide sequence encoding 
hepatitis B virus LS, MS, S, or a 
derivative of any of the foregoing 
10 displaying the antigenicity of any of 

the f or ego ing ; 
(b) a first promoter sequence operably 
linked to said first nucleotide 
sequence ; 

15 (c) a second nucleotide sequence encoding a 

fusion protein comprising a first 
protein sequence containing at least 
one hepatitis B virus surface antigen 
epitope fused via a peptide bond to a 
20 second protein sequence containing at 

least one hepatitis B virus core 
antigen epitope; and 
(d) a second promoter sequence operably 
linked to said second nucleotide 
25 sequence; 

whereby the first and second nucleotide sequences are 
expressed by the virus in a suitable host. 

5. The virus of claim 4 in which the 
30 fusion protein is selected from the group consisting 
of the following hepatitis B virus sequences: a preSl 
region fused to a core antigen, a preSl region fused 
to a core antigen derivative, a- core antigen fused to 
S, a core antigen fused to a preS2 region, a core 
35 antigen derivative fused to S, a core antigen 
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derivative fused to a preS2 region, a core antigen 
fused to a preSl region, a core antigen derivative 
fused to a preSl region, and a core antigen fused to a 
preSl region derivative; in which the core antigen 
5 derivative displays the antigenicity of the 

hepatitis B virus core antigen, and the preSl region 
derivative displays the antigenicity of the 
hepatitis B virus preSl region. 

10 6. The virus of claim 4 in which the 

fusion protein is selected from the group consisting 
of a core-preSl fragment fusion protein having the 
amino acid sequence shown in Figure 44 (SEQ ID NO: 26), 
a core-preS2 fusion protein having the amino acid 

15 sequence shown in Figure 46 (SEQ ID NO:30), a preSl- 
coreA8 fusion protein having the amino acid sequence 
shown in Figure 76 (SEQ ID NO:45) , and a preSl-core 
fusion protein having the amino acid sequence shown in 
Figure 78 (SEQ ID NO:49) . 

20 

7, The virus of claim 6 in which the first 
nucleotide sequence comprises the sequence encoding MS 
as shown in Figure 23 (SEQ ID NO:7). 

25 8. The virus of claim 6 in which the first 

nucleotide sequence encodes said derivative, said 
derivative being a fusion protein comprising a third 
protein sequence containing at least one hepatitis B 
virus surface antigen epitope, which third protein 

3 0 sequence is fused via a peptide bond to a fourth 

protein sequence containing at least one hepatitis B 
virus core antigen epitope; with the proviso that the 
first nucleotide sequence and the second nucleotide 
sequence have sequences that do not overlap by more 

35 than 20 nucleotides. 
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9- A recombinant: replicable vaccinia 
virus, the genome of which comprises: 

(a) a first; nucleotide sequence encoding a 
hepatitis B virus MS antigen; 
5 (b) a first promoter sequence operably 

linked to said first nucleotide 
sequence; 

(c) a second nucleotide sequence encoding a 
fusion protein comprising a hepatitis B 

10 virus core antigen or a derivative 

thereof displaying core antigenicity, 
fused at its carboxy terminus via a 
peptide bond to the amino terminus of a 
hepatitis B virus preSl region; and 

15 (d) a second promoter sequence operably 

linked to said second nucleotide 
sequence; 

whereby the first and second nucleotide sequences are 
expressed by the virus in a suitable host* 

20 

10. A recombinant replicable vaccinia 
virus, the genome of which comprises: 

(a) a first nucleotide sequence encoding a 
hepatitis B virus MS antigen; 
25 (b) a first promoter sequence operably 

linked to said first nucleotide 
sequence ; 

(c) a second nucleotide sequence encoding a 
fusion protein comprising a hepatitis B 
3 0 virus presi region fused at its carboxy 

terminus via a peptide bond to the 
amino terminus of a hepatitis B virus 
core antigen or "a derivative thereof 
displaying core antigenicity; and 

35 
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(d) a second promoter sequence operably 
linked to said second nucleotide 
sequence ; 

whereby the first and second nucleotide sequences are 
5 expressed by the virus in a suitable host. 

11. The virus of claim 9 in which the core 
antigen or derivative thereof is a core antigen 
derivative having the amino acid sequence shown in 

10 Figure 33 (SEQ ID NO: 15) or in Figure 30 (SEQ ID 
NO: 13) . 

12. The virus of claim 10 in which the core 
antigen or derivative thereof is a core antigen 

15 derivative having the amino acid sequence shown in 
Figure 33 (SEQ ID NO: 15) or in Figure 30 (SEQ ID 
NO: 13) . 

13. The virus of claim 12 in which the 
20 first promoter is modified p7.5, and the second 

promoter is p7.5. 

14. The virus of claim 12 in which the 
first promoter is p7.5, and the second promoter is 

25 pll. 

15. A recombinant replicable vaccinia 
virus, the genome of which comprises: 

(a) a first nucleotide sequence encoding a 
30 first fusion protein comprising a first 

hepatitis B virus core antigen 
derivative displaying core antigenicity 
fused at its carboxy terminus via a 
peptide bond to the amino terminus of a 
35 hepatitis B virus preS2 region; 
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(b) a first promoter sequence operably 
linked to said first nucleotide ' 



sequence ; 



10 



15 



20 



25 



30 



(c) a second nucleotide sequence encoding a 
second fusion protein comprising a 
hepatitis B virus preSl region fused at 
its car boxy terminus via a peptide bond 
to the amino terminus of a second 
hepatitis B virus core antigen 
derivative displaying core 
antigenicity; and 

(d) a second promoter sequence operably 
linked to said second nucleotide 
sequence; 



with the proviso that the first and second nucleotide 
sequences have sequences that do not overlap by more 
than 20 nucleotides; whereby the first and second 
nucleotide sequences are expressed by the virus in a 
suitable host. 



16* The virus of claim 15 in which the core 



antigen or derivative thereof is a core antigen 
derivative having the amino acid sequence shown in 
Figure 33 (SEQ ID NO: 15) or in Figure 30 (SEQ ID 
NO:13) „ 

17. The virus of claim 16 in which the 
first promoter is modified p7.5, and the second 
promoter is p7.5. 



virus, the genome of which comprises a nucleotide 
sequence encoding a hepatitis B virus core antigen 
derivative having the amino acid sequence shown in 



18. A recombinant replicable vaccinia 



Figure 30 (SEQ ID 



NO: 13); and a promoter sequence 
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operably linked to said nucleotide sequence, whereby 
the nucleotide sequence is expressed by the virus in a 
suitable host. 

5 19, A recombinant replicable vaccinia 

virus, the genome of which comprises a nucleotide 
sequence encoding a fusion protein, which protein 
comprises a first protein sequence containing at least 
one hepatitis B virus core antigen epitope / wherein 

10 said first protein sequence is fused at its carboxy 
terminus via a peptide bond to a second protein 
sequence containing at least one hepatitis B virus 
surface antigen epitope; and a promoter sequence 
operably linked to said nucleotide sequence, whereby 

15 the nucleotide sequence is expressed by the virus in a 
suitable host. 

20. A recombinant replicable vaccinia 
virus, the genome of which comprises a nucleotide 

20 sequence encoding a fusion protein, which protein 

comprises a first protein sequence containing at least 
one hepatitis B virus surface antigen epitope, wherein 
said first protein sequence is fused at its carboxy 
terminus via a peptide bond to a second protein 

25 sequence containing at least one hepatitis B virus 

core antigen epitope; and a promoter sequence operably 
linked to said nucleotide sequence,, whereby the 
nucleotide sequence is expressed by the virus in a 
suitable host. 

30 

21. The virus of claim 19 in which the 
fusion protein is selected from the group consisting 
of the following hepatitis B virus sequences: a core 
antigen or core antigen derivative fused to a preSl 

35 region or preSl derivative, core antigen or core 
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antigen derivative fused to a preS2 region or preS2 
derivative, or a core antigen or core antigen 
derivative fused to an S protein; in which the core 
antigen derivative displays the antigenicity of the 
5 core antigen, the preSl derivative displays the 

antigenicity of the preSl region, the preS2 derivative 
displays the antigenicity of the preS2 region, and the 
s derivative displays the antigenicity of the S 
protein. 

10 

22. The virus of claim 1 in which the first 
and second nucleotide sequences are situated 
internally with respect to vaccinia virus thymidine 
kinase gene sequences, and the virus does not express 

15 a functional thymidine kinase. 

23. The virus of claim 4 in which the first 
and second nucleotide sequences are situated 
internally with respect to vaccinia virus thymidine 

20 kinase gene sequences, and the virus does not express 
a functional thymidine kinase. 

24. The virus of claim 9 in which the first' 
and second nucleotide sequences are' situated 

25 internally with respect to vaccinia virus thymidine 
kinase gene sequences, and the virus does not express 
a functional thymidine kinase. 

25. The virus of claim 10 in which the 
30 first and second nucleotide sequences are situated 

internally with respect to vaccinia virus thymidine 
kinase gene sequences, and the virus does not express 
a functional thymidine kinase. 
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26. The virus of claim 19 in which the 
nucleotide sequence is situated internally with 
respect to vaccinia virus thymidine kinase gene 
sequences, and the virus does not express a functional 

5 thymidine kinase. 

27. Plasmid pHTL-31, as deposited with the 
ATCC and assigned accession number 69124. 

28. A recombinant replicable vaccinia 
virus, the genome of which comprises: 

(a) a first nucleotide sequence that 
comprises the p7.5 promoter operably 
linked to a first coding sequence 
encoding a fusion protein consisting of 
a hepatitis B virus preSl region fused 
to a hepatitis B virus core antigen 
derivative, as contained in plasmid 
pHTL-31; and 

(b) a second nucleotide sequence that 
comprises (i) the modified p7.5 
promoter operably linked to a second 
coding sequence encoding a hepatitis B 
virus MS antigen, and (ii) a sequence 
upstream of the initiator ATG of the MS 
antigen that is associated with strong 
ribosome binding, as contained in 
plasmid pHTL-3 1 ; 

wherein plasmid pHTL-31 is as deposited with the ATCC 
and assigned accession no. 69124. 

29. The virus of claim 1 which further 
comprises a sequence upstream of the initiator ATG in 
the first nucleotide sequence, that matches a 
35 
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consensus sequence associated with strong ribosoroe 
binding . 

30. The virus of claim 9 which further 
5 comprises a sequence upstream of the initiator ATG for 
the MS antigen in the first nucleotide sequence, that 
matches a consensus sequence associated with strong 
ribosoroe binding. 

10 31- The virus of claim 10 which further 

comprises a sequence upstream of the initiator ATG for 
the MS antigen in the first nucleotide sequence, that 
. matches a consensus sequence associated with strong 
ribosome binding. 

15 

32. A vaccine formulation comprising the 
recombinant vaccinia virus of claim 1; and a 
pharmaceutically acceptable carrier. 

20 33. A vaccine formulation comprising the 

recombinant vaccinia virus of claim 4; and a 
pharmaceutically acceptable carrier. 



34. A vaccine formulation comprising the 
25 recombinant vaccinia virus of claim 9; and a 

pharmaceutically acceptable carrier. 

35. A vaccine formulation comprising the 
recombinant vaccinia virus of claim 10; and a 

30 pharmaceutically acceptable carrier. 

36. A vaccine formulation comprising the 
recombinant vaccinia virus of claim 15; and a 
pharmaceutically acceptable carrier. 

35 
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37. A vaccine formulation comprising a 
first recombinant virus of claim 1; a second 
recombinant virus of claim 1; and a pharmaceutical ly 
acceptable carrier; in which the first and second 

5 recombinant viruses encode different hepatitis B virus 
amino acid sequences. 

38. A vaccine formulation comprising a 
first recombinant virus of claim 4 ; a second 

10 recombinant virus of claim 4; and a pharmaceutically 
acceptable carrier; in which the first and second 
recombinant viruses encode different hepatitis B virus 
amino acid sequences. 

15 39. A vaccine formulation comprising a 

first recombinant virus of claim 19; a second 
recombinant virus of claim 19; and a pharmaceutically 
acceptable carrier; in which the first and second 
recombinant viruses express different fusion proteins. 

20 

40. A vaccine formulation comprising: 
(a) a first recombinant replicable vaccinia 
virus, the genome of which comprises 
(i) a first nucleotide sequence 

25 encoding a first fusion protein, said 

first fusion protein comprising a 
hepatitis B virus core antigen or 
derivative thereof displaying core 
antigenicity, fused at its carboxy 

30 terminus via a peptide bind to a 

hepatitis B virus preSl region or 
derivative thereof displaying preSl 
antigenicity, and (ii) a first promoter 
sequence operably linked to said first 

35 nucleotide sequence, whereby the first 
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nucleotide sequence is expressed by 
said first virus in a suitable host; 
(b) a second recombinant replicable 

vaccinia virus , the genome of which 
5 comprises (i) a second nucleotide 

sequence encoding a second fusion 
protein, said second fusion protein 
comprising a hepatitis B virus core 
antigen or derivative thereof 

XO displaying core antigenicity, fused at 

its carboxy terminus via a peptide bind 
to a hepatitis B virus preS2 region or 
derivative thereof displaying preS2 
antigenicity, and (ii) a second 

15 promoter sequence operably linked to 

said second nucleotide sequence, 
whereby the second nucleotide sequence 
is expressed by said second virus in a 
suitable host; 

20 (c) a third recombinant replicable vaccinia 

virus, the genome of which comprises 
(i) a third nucleotide sequence 
encoding a hepatitis B virus S protein 
or a derivative thereof displaying S 

25 antigenicity, and (ii) a third promoter 

sequence operably linked to said third 
nucleotide sequence, whereby the third 
nucleotide sequence is expressed by 
said third virus in a suitable host; 

30 and 

(d) a pharmaceutical ly acceptable carrier, 

41. The vaccine formulation of claim 40 in 
which the first fusion protein has the amino acid 
35 sequence (SEQ ID NO:26) depicted in Figure 44, the 
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first, promoter is modified p7.5, the second fusion 
protein has the amino acid sequence (SEQ ID NO: 30) 
depicted in Figure 46, the second promoter is modified 
p7.5, the third nucleotide sequence encodes a protein 
5 having the amino acid sequence (SEQ ID NO: 10) depicted 
in Figure 24, and the third promoter is modified p.7.5. 



42. A vaccine formulation comprising: 
(a) a first recombinant replicable vaccinia 
virus, the genome of which comprises 
(i) a first nucleotide sequence 
encoding a first fusion protein, said 
first fusion protein comprising a 
hepatitis B virus core antigen or 
derivative thereof displaying core 
antigenicity, fused at its carboxy 
terminus via a peptide bind to a 
hepatitis B virus preSl region or 
derivative thereof displaying preSl 
antigenicity, and (ii) a first promoter 
sequence operably linked to said first 
nucleotide sequence, whereby the first 
nucleotide sequence is expressed by 
said first virus ih a suitable host; 
(b) a second recombinant replicable 

vaccinia virus, the genome of which 
comprises (i) a second nucleotide 
sequence encoding a hepatitis B virus 
MS protein or a derivative thereof 
displaying MS antigenicity, and (ii) a 
second promoter operably linked to said 
second nucleotide sequence, whereby the 
second nucleotide sequence is expressed 
by said second virus in a suitable 



35 



host; and 
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(c) a pharmaceutical ly acceptable carrier. 

43. The vaccine formulation of claim 42 in 
which the first fusion protein has the amino acid 

5 seguence (SEQ ID NO: 26) depicted in Figure 44 , the 

first promoter is modified p7.5, the second nucleotide 
sequence encodes an MS protein having the seguence 
(SEQ ID NO: 8) depicted in Figure 23, and in which a 
sequence upstream of the second nucleotide sequence 
10 matches a consensus sequence associated with strong 
ribosome binding. 

44. A vaccine formulation comprising: 

(a) a first recombinant repli cable vaccinia 

15 virus, the genome of which comprises 

(i) a first nucleotide sequence 
encoding a first fusion protein, said 
first fusion protein comprising a 
hepatitis B virus core antigen or 

20 derivative thereof displaying core 

antigenicity, fused at its carboxy 
terminus via a peptide bind to a 
hepatitis B virus preS2 region or 
derivative thereof displaying preS2 

25 antigenicity, and (ii) a first promoter 

sequence operably linked to said first 
nucleotide sequence, whereby the first 
nucleotide sequence is expressed by 
said first virus in a suitable host; 

30 (b) a second recombinant replicable 

vaccinia virus, the genome of which 
comprises (i) a second nucleotide 
sequence encoding a hepatitis B virus S 
protein or a derivative thereof 

35 displaying S antigenicity, and (ii) a 
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second promoter sequence operably 
linked to said second nucleotide 
sequence, whereby the second nucleotide 
sequence is expressed by said second 
5 virus in a suitable host; and 

(c) a pharmaceutically acceptable carrier. 

45. The vaccine formulation of claim 44 in 
which the first fusion protein has the amino acid 
sequence (SEQ ID NO: 30) depicted in Figure 46, the 
first promoter is modified p7.5, the second nucleotide 
sequence encodes a protein having the amino acid 
sequence (SEQ ID NO: 10) depicted in Figure 24, and the 
second promoter is modified p7.5. 

46. A method of preventing hepatitis or 
other undesirable consequence of hepatitis B virus 
infection in a subject comprising administering to the 
subject an effective amount of the recombinant virus 
of claim 1. 

47 . A method of preventing hepatitis or 
other undesirable consequence of hepatitis B virus 
infection in a subject comprising administering to the 

25 subject an effective amount of the recombinant virus 
of claim 4. 

48. A method of preventing hepatitis or 
other undesirable consequence of hepatitis B virus 

30 infection in a subject comprising administering to the 
subject an effective amount of the recombinant virus 
of claim 9. 

49. A method of preventing hepatitis or 
35 other undesirable consequence of hepatitis B virus 



15 
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infection in a subject, comprising administering to the 
subject an effective amount of the recombinant virus 
of claim 10. 

50. A method of preventing hepatitis or 
other undesirable consequence of hepatitis B virus 
infection in a subject comprising administering to the 
subject an effective amount of the recombinant virus 
of claim 15. 



10 



51. A method of preventing hepatitis or 
other undesirable consequence of hepatitis B virus 
infection in a subject comprising administering to the 
subject an effective amount of the vaccine formulation 

15 of claim 39. 

52. A method of preventing hepatitis or 
other undesirable consequence of hepatitis B virus 
infection in a subject comprising administering to the 

20 subject an effective amount of the vaccine formulation 
of claim 40. 

53. A method of preventing hepatitis or 
other undesirable consequence of hepatitis B virus 

25 infection in a subject comprising administering to the 
subject an effective amount of: 

(a) a first recombinant replicable vaccinia 
virus, the genome of which comprises 
(i) a first nucleotide sequence 

30 encoding a first fusion protein, said 

first fusion protein comprising a 
hepatitis B virus core antigen or 
derivative thereof displaying core 
antigenicity, fused at its carboxy 

35 terminus via a peptide bind to a 
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hepatitis B virus preSl region or 
derivative thereof displaying preSl 
antigenicity, and (ii) a first promoter 
sequence operably linked to said first 
nucleotide sequence, whereby the first 
nucleotide sequence is expressed by 
said first virus in a suitable host; 
a second recombinant replicable 
vaccinia virus, the genome of which 
comprises (i) a second nucleotide 
sequence encoding a second fusion 
protein, said second fusion protein 
comprising a hepatitis B virus core 
antigen or derivative thereof 
displaying core antigenicity, fused at 
its carboxy terminus via a peptide bind 
to a hepatitis B virus preS2 region or 
derivative thereof displaying preS2 
antigenicity, and (ii) a second 
promoter sequence operably linked to 
said second nucleotide seguence, 
whereby the second nucleotide sequence 
is expressed by said second virus in a 
suitable host; and 

a third recombinant replicable vaccinia 
virus, the genome of which comprises 
(i) a third nucleotide sequence 
encoding a hepatitis B virus S protein 
or a derivative thereof displaying S 
antigenicity, and (ii) a third promoter 
sequence operably linked to said third 
nucleotide seguence, whereby the third 
nucleotide sequence is expressed by 
said third virus in a suitable host. 
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54 . A method of preventing hepatitis or 
other undesirable consequence of hepatitis B virus 
infection in a subject comprising administering to the 
subject an effective amount of: 
5 (a) a first recombinant replicable vaccinia 

virus, the genome of which comprises 
(i) a first nucleotide sequence 
encoding a first fusion protein, said 
first fusion protein comprising a 
10 hepatitis B virus core antigen or 

derivative thereof displaying core 
antigenicity, fused at its carboxy 
terminus via a peptide bind to a 
hepatitis B virus preSl region or 
15 derivative thereof displaying preSl 

antigenicity, and (ii) a first promoter 
sequence operably linked to said first 
nucleotide sequence, whereby the first 
nucleotide sequence is expressed by 
2 0 said first virus in a suitable host; 

and 

(b) a second recombinant replicable 

vaccinia virus, the genome of which 
comprises (i) a second nucleotide 

25 sequence encoding a hepatitis B virus 

MS protein or a derivative thereof 
displaying MS antigenicity, and (ii) a 
second promoter operably linked to said 
second nucleotide sequence, whereby the 

30 second nucleotide sequence is expressed 

by said second virus in a suitable 
host. 

55, A method of preventing hepatitis or 
35 other undesirable consequence of hepatitis B virus 
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infection in a subject comprising administering to the 
subject an effective amount of: 

(a) a first recombinant replicable vaccinia 
virus, the genome of which comprises 
5 (i) a first nucleotide sequence 

encoding a first fusion protein , said 
first fusion protein comprising a 
hepatitis B virus core antigen or 
derivative thereof displaying core 
10 antigenicity, fused at its carboxy 

terminus via a peptide bind to a 
hepatitis B virus preS2 region or 
derivative thereof displaying preS2 
antigenicity, and (ii) a first promoter 
is sequence operably linked to said first 

nucleotide sequence, whereby the first 
nucleotide sequence is expressed by 
said first virus in a suitable host; 
and 

2 0 (b) a second recombinant replicable 

vaccinia virus, the genome of which 
comprises (i) a second nucleotide 
sequence encoding a hepatitis B virus S 
protein or a derivative thereof 

25 displaying S antigenicity, and (ii) a 

second promoter sequence operably 
linked to said second nucleotide 
sequence, whereby the second nucleotide 
sequence is expressed by said second 

30 virus in a suitable host. 

56. A method of treating hepatitis or other 
undesirable consequence of hepatitis B virus infection 
comprising administering to a subject in need of such 

35 
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treatment an effective amount of the recombinant 
vaccinia virus of claim 1* 

57. A method of treating hepatitis or other 
5 undesirable consequence of hepatitis B virus infection 
comprising administering to a subject in need of such 
treatment an effective amount of the recombinant 
vaccinia virus of claim 4. 

10 58 - A purified protein having the amino 

acid sequence (SEQ ID NO: 13) depicted in Figure 30. 

59. A purified nucleic acid encoding the 
protein of claim 58* 

15 
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| 10 | 20 | 30 | 40 | 50 | 60 
1 ATAAATAATA AATACAATAA TTAATTTCTC GTAAAAGTAG AAAATATATT CTAATTTATT 60 
61 GCAC 64 
| 10 | 20 | 30 | 40 | 50 | 60 



FIG.2 



| 10 | 20 | 30 | 40 | 50 | 60 
1 CTAGAAGCGA TGCTACGCTA GTCACAATCA CCACTTTCAT ATTTAGAATA TATGTATGTA 60 
61 AAAATATAGT AGAATTTCAT TTTGTTTTTT TCTATGCTAT AAAT 104 
| 10 | 20 | 30 | 40 | 50 | 60 



FIG.4 



| 10 | 20 | 30 | 40 | 50 | 60 
1 AAAAATTGAA ATTTTATTTT TTTTTTTTGG AATATAAATA AG 42 
| 10 | 20 | 30 | 40 | 50 | 60 
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1/1 31/11 

ATG GGG ACG MT CTT TCT GTT CCC Ml OCT CTG GGA TTC ITT CCC GAT CAT CAG TTG GAC 
met gly thr osn leu ser vol pro osn pro leu gly phe phe pro osp his gin leu asp 
61 / 21 91/31 

CCT GCA TTC GGA GCC AAC TCA Aoc AAT CCA GAT TGG GAC TTC AAC CCC ATC AAG GAC CAC 
pro olo phe gly olo osn ser osn osn pro osp trp osp phe osn pro ile lys osp his 
121 / 41 151 / 51 

TGG CCA GCA GCC AAC CAG GTA GGA GTG GGA GCA TTC GGG CCA GGG tTC ACC CCT CCA CAC 
trp pro olo olo osn gin vol gly vol gly. olo phe gly pro gly phe thr pro pro his 
181 / 61 211 / 71 

GGC GGT gTT TTG GGG TGG AGC CCT CAG GCT CAG GGC ATA TTG ACC ACA GTG TCA ACA ATT 
gly gly vol leu gly trp ser pro gin olo gin gly ile leu thr thr vol ser thr ile 
241 / 81 271 / 91 

CCT CCT CCT GCC TCC ACC AAT CGG CAG TCA GGA AGG CAG CCT ACT CCC ATC TCT CCA CCT 
pro pro pro olo ser thr osn org gin ser gly org gin pro thr pro ile ser pro pro 
301 / 101 331 / 111 

CTA AGA GAC ACT CAT CCT CAG GCC ATG CAG TGG AAc TCC ACT GCC TTc CAC CAA GCT CTG 
leu org osp ser his pro gin olo met gin trp osn ser thr olo phe his gin olo leu 
361 / 121 391 / 131 

CAG GAT CCC AGA GTC AGG GGT CTG TAT tTT CCT GCT GGT GGC TCC ACT TCA GGA ACA GTA 
gin osp pro org vol org giy leu tyr phe pro olo gly gly ser ser ser gly thr vol 
421 / 141 451 / 151 

AAC CCT GCT CCG AAT ATT GCC TCT CAC ATC TGG TCA ATC TCC GCG AGG ACT GGG GAC CCT 
osn pro olo pro osn ile olo ser his ile ser ser ile ser olo org thr gly osp pro 
481 / 161 511 / 171 

GTG ACG AoC ATG GAG AAC ATC ACA TCA GCA TTC CTA GGA CCC CTG CTC GTG TTA CAG GCG 
vol thr osn met glu osn ile thr ser gly phe leu gly pro leu leu vol leu gin olo 
541 / 181 571 / 191 

GGG TTT TTC TTG TTG ACA AGA ATC CTC ACA ATA CCG CAG AGT CTA GAC TCC TGG TGG ACT 
gly phe phe leu leu thr org ile leu thr ile pro gin ser leu osp ser trp trp thr 
601 / 201 631 / 211 

TCT CTC AAT TTT CTA GGG GGA TCA CCC GTG TGT CTT GGC CAA AAT TCG CAG TCC CCA ACC 
ser leu osn phe leu gly gly ser pro vol cys leu gly gin osn ser gin ser pro thr 
661 / 221 691 / 231 

TCC AAT CAC TCA CCA ACC TCC TGT CCT CCA ATT TGT CCT GGT TAT CGC TGG ATG TGT CTG 
ser osn his ser pro thr ser cys pro pro ile cys pro gly tyr org trp met cys leu 
721 / 241 751 / 251 

CGG CGT TTT ATC ATA TTC CTC TTC ATC CTG CTG CTA TGC CTC ATC TTC TTA TTG GTT CTT 
org org phe ile ile phe leu phe ile leu leu leu cys leu ile phe leu leu vol leu 
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leu osp tyr gin gly met leu pro vol cys pro leu ile pro gly ser thr thr thr ser 
841 / 281 871 / 291 
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thr gly pro cys lys thr cys thr thr pro olo gin gly osn ser net phe pro ser cys 
901 / 301 931 / 311 

TGC TGT ACA AAA CCT ACG GAT GGA AAT TGC ACC TGT ATT CCC ATC CCA TCG TCI TGG. GCT 
cys cys thr lys pro thr osp gly osn cys thr cys ile pro ile pro ser ser trp olo 
961 / 321 991 / S3 1 

TTC GCA AAA TAC CTA TGG GAG TGG GCC TCA GTC CGT TTC TCT TGG CTC AGT TTA CTA GTG 
phe olo lys tyr leu trp glu trp olo ser vol org phe ser trp leu ser leu leu vol 
1021 / 341 1051 / 351 

CCA TTT GTT CAG TGG TTC GTA GGG CTT TCC CCC ACT GTT TGG CTT TCA GCT ATA TGG ATG 
pro phe vol gin trp phe vol gly leu ser pro thr vol trp leu ser olo ile trp met 
1081 / 361 mi / 371 

ATG TGC TAT TGG GGG CCA AGT CTG TAC AGC ATC GTG AGT CCC TTT ATA CCC CTG TTA CCA 
met trp tyr trp gly pro ser leu tyr ser ile vol ser pro phe ile pro leu leu pro 
1141 / 381 

ATT TTC TTT TGT CTC TGG GTA TAC ATT TAA 
ile phe phe cys leu trp vol tyr ile OCH 
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ATG CAG TGG AAC TCC ACT GCC TTC CAC CAA GCT CTG CAG GAT CCC AGA GTC AGG GGT CTG 
net gin trp osn ser thr olo phe his gin olo leu gin osp pro org vol org gly leu 
61 / 21 91 / 31 

TAT TTT CCT GCT GGT GGC TCC ACT TCA GGA ACA GTA AAC CCT GCT CCG AAT ATT GCC TCT 
tyr phe pro olo gly gly ser ser ser gly thr vol osn pro olo pro osn ile olo ser 
121 / 41 151 / 51 

CAC ATC TOG TCA ATC TCC GCG AGG ACT GGG GAC CCT GTG ACG AAC ATG GAG AAC ATC ACA 
his ile ser ser ile ser olo org thr gly osp pro vol thr osn met glu osn ile thr 
181 / 61 211 / 71 

TCA GGA TTC CTA GGA CCC CTG CTC GTG TTA CAG GCG GGG TTT TTC TTG TTG ACA AGA ATC 
ser gly phe leu gly pro leu leu vol leu gin olo gly phe phe leu leu thr org ile 
241 / 81 271 / 91 

CTC ACA ATA CCG CAG AGT CTA GAC TCG TGG TGG ACT TCT CTC AAT TTT CTA GGG GGA TCA 
leu thr ile pro gin ser leu osp ser Irp Irp thr ser leu osn phe leu gly gly ser 
301 / 101 331 / 111 

CCC GTG TCT CTT GGC CAA AAT TCG CAG TCC CCA ACC TCC AAT CAC TCA CCA ACC TCC TGT 
pro vol cys leu gly gin osn ser gin ser pro thr ser osn his ser pro thr ser cys 
361 / 121 391 / 131 

CCT CCA ATT TGT CCT GGT TAT CGC TGG ATG TGT CTG GGG GGT TTT ATC ATA TTC CTC TTC 
pro pro ile cys pro gly tyr org trp met cys leu org org phe ile ile phe leu phe 
421 / 141 451 / 151 

ATC CTG CTG CTA TGC CTC ATC TTC TTA TTG GTT CTT CTG GAT TAT CAA GGT ATG TTG CCC 
ile leu leu leu cys leu ile phe leu leu vol leu leu osp tyr gin gly met leu pro 
481 / 161 511 / 171 

GTT TGT CCT CTA ATT CCA GGA TCA ACA ACA ACC AGT ACG GGA CCA TGC AAA ACC TGC ACG 
vol cys pro leu ile pro gly ser thr thr thr ser thr gly pro cys lys thr cys thr 
541 / 181 571 / 191 

ACT CCT GCT CAA GGC AAC TCT ATG TTT CCC TCA TGT TGC TGT ACA AAA CCT ACG GAT GGA 
thr pro olo gin gly osn ser net phe pro ser cys cys cys thr lys pro thr osp gly 
601 / 201 631 / 211 

AAT TGC ACC TGT ATT CCC ATC CCA TCG TCT TGG GCT TTC GCA AAA TAC CTA TGG GAG TGG 
osn cys thr cys ile pro ile pro ser ser Irp olo phe olo lys tyr leu trp glu trp 
661 / 221 691 / 231 

GCC TCA GTC CGT TTC TCT TGG CTC AGT TTA CTA GTG CCA TTT GTT CAG TGG TTC GTA GGG 
olo ser vol org phe ser trp leu ser leu leu vol pro phe vol gin trp phe vol gly 
721 / 241 751 / 251 

CTT TCC CCC ACT GTT TGG CTT TCA GCT ATA TGG ATG ATG TGG TAT TGG GGG CCA AGT CTG 
leu ser pro thr vol trp leu ser olo ile trp met met trp tyr trp gly pro ser leu 
781 / 261 811 / 271 

TAC ACC ATC GTC AGT CCC TTT ATA CCG CTG TTA CCA ATT TTC TTT TGT CTC TGG GTA TAC 
tyr ser ile vol ser pro phe ile pro leu leu pro ile phe phe cys leu trp vol tyr 
841 / 281 
ATT TAA 
ile OCH 
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1/1 31/11 

ATG GGG ACG AAT CTT TCT GTT CCC AAt CCT CTG GGA TTC ITT CCC GAT CAT CAG TTG CAC 
me I gly thr osn leu ser vol pro osn pro leu gly phe phe pro osp his gin leu osp 
61 / 21 91 / 31 

CCT GCA TTC GGA GCC AAC TCA Aoc AAT CCA GAT TGG GAC TTC AAC CCC ATC AAG GAC CAC 
pro alo phe gly olo osn ser osn osn pro osp trp osp phe osn pro ile lys asp his 
121 / 41 151 / 51 

TGG CCA GCA GCC AAC CAG GTA GGA GTG. GGA GCA TTC GGG CCA GGG tTC ACC CCT CCA CAC 
trp pro ola olo osn gin vol gly vol gly olo phe gly pro gly phe thr pro pro his 
181 / 61 211 / 71 

GGC GGT gTT TTG GGG TGG AGC CCT CAG GCT CAG GGC ATA TTG ACC ACA GTG TCA ACA ATT 
gly gly vol leu gly trp ser pro gin olo gin gly ile leu thr thr vol ser thr ile 
241 / 81 271 / 91 

CCT CCT CCT GCC TCC ACQ AAT CGG CAG TCA GGA AGG CAG CCT ACT CCC ATC TCT CCA CCT 
pro pro pro olo ser thr osn org gin ser gly org gin pro thr pro ile ser pro pro 
301 / 101 331 / 111 

CTA AGA GAC AGT CAT CCT CAG GCC ATG GAC ATT GAC CCT TAT AAA GAA TTT GGA GCT ACT 
leu org osp ser his pro gin olo met osp ile osp pro tyr lys glu phe gly olo thr 
361 / 121 391 / 131 

GTG GAG TTA CTC TCG TTT TTG CCT TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC 
vol glu leu leu ser phe leu pro ser asp phe phe pro ser vol org osp leu leu asp 
421 / 141 451 / 151 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC TCA CCT CAC CAC 
thr alo ser olo leu tyr org glu olo leu glu ser pro glu his cys ser pro his his 
481 / 161 511 / 171 

ACC GCA CTC AGG CAA GCC ATT CTC TGC TGG GGG GAA TTG ATG ACT CTA GCT ACC TGG GTG 
thr alo leu org gin olo ile leu cys trp gly glu leu met thr leu alo thr trp vol 
541 / 181 571 / 191 

GGT AAT AAT TTG GAG GAT CCA GCA TCA AGG GAT CTA GTA GTC AAT TAT GTT AAT ACT AAC 
gly osn osn leu glu osp pro olo ser org osp leu vol vol osn tyr vol osn thr osn 
601 / 201 631 / 211 

ATG GGT TTA AAA ATT AGG CAA CTA TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA 
met gly leu lys ile org gin leu leu trp phe his ile ser cys leu thr phe gly org 
661 / 221 691 / 231 

GAG ACT GTA CTT GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT CCT CCA GCC TAT 
glu thr vol leu glu tyr leu vol ser phe gly vol trp ile org thr pro pro alo tyr 
721 / 241 751 / 251 

AGA CCA CCA AAT GCC CCT ATC TTA TCA ACA CTT CCG GAA ACT ACT GTT GTT AGA GGA CGG 
org pro pro osn olo pro ile leu ser thr leu pro glu thr thr vol vol org org org 
781 / 261 811 / 271 

GAC CGA GGC AGG TCC CCT AGA AGA AGA ACT CCC TCG CCT CGC AGA CGC AGA TCT CAA TCT 
asp org gly org ser pro org org org thr pro ser pro org org org org ser gin ser 
841 / 281 

CGG GAA TCT CAA TGT TAG 
org glu ser gin cys AMB 
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1/1 31/11 

Atg ggg ocg oat ctt tct gtt ccc ool cct ctg ggo ttc ttt ccc gat cat cag ttg goc 
net gly thr osn leu ser vol pro osh pro leu gly phe phe pro osp his gin leu osp 
61 / 21 91 / 31 

cct gco ttc ggo gcc ooc tco ooc oot ceo got tgg gac ttc ooc ccc otc oog goc coc 
pro olo phe gly olo osn ser osn osn pro osp trp osp phe osn pro ile lys asp his 
121 / 41 151 / 51 

tgg ceo gco gcc ooc cog gto ggo gtg ggo gco ttc ggg ceo ggg ttc acc cct ceo coc 
trp pro olo olo osn gin vol gly vol gly olo phe gly pro gly phe thr pro pro his 
181 / 61 211 / 71 

ggc ggt gtt ttg ggg tgg ogc cct cog gel cog ggc olo ttg occ oco gtg tco aco ott 
gly gly vol leu gly trp ser pro gin olo gin gly ile leu thr thr vol ser thr ile 
241 / 81 271 / 91 

cct cct cct gcc tec acc ool egg cog tco ggo agg cog cct act ccc otc tel ceo cct 
pro pro pro olo ser thr osn org gin ser gly org gin pro thr pro ile ser pro pro 
301 / 101 331 / 111 

cto ogo goc ogt cot cct cog gcc ATG GAC ATT GAC CCT TAT AAA GAA TTT GGA GCT ACT 
leu org osp ser his pro gin olo met osp ile osp pro tyr lys glu phe gly olo thr 
361 / 121 391 / 131 

GTG GAG TTA CTC TOG TTT TTG CCT TCT GAC TTC TTT CCT TCC GTC AGA GAT CTC CTA GAC 
vol glu leu leu ser phe leu pro ser osp phe phe pro ser vol org osp leu leu asp 
421 / 141 451 / 151 

ACC GCC TCA GCT CTG TAT CGG GAA GCC TTA GAG TCT CCT GAG CAT TGC TCA CCT CAC CAC 
thr olo ser olo leu tyr org glu olo leu glu ser pro glu his cys ser pro his his 
481 / 161 511 / 171 

ACC GCA CTC AGG CAA GCC ATT CTC TGC TGG GGG GAA TTG ATG ACT CTA GCT ACC TGG GTG 
thr olo leu org gin olo ile leu cys trp gly glu leu met thr leu olo thr trp vol 
541 / 181 571 / 191 

GGT AAT AAT TTG GAG GAT CCA GCA TCA AGG GAT CTA GTA GTC AAT TAT GTT AAT ACT AAC 
gly osn osn leu glu osp pro olo ser org osp leu vol vol osn tyr vol osn thr asn 
601 / 201 631 / 211 

ATG GGT TTA AAA ATT AGG CAA CTA TTG TGG TTT CAT ATA TCT TGC CTT ACT TTT GGA AGA 
met gly leu lys ile org gin leu leu Irp phe his ile ser cys leu Ihr phe gly org 
661 / 221 691 / 231 

GAG ACT GTA CTT GAA TAT TTG GTA TCT TTC GGA GTG TGG ATT CGC ACT CCT CCA GCC TAT 
glu thr vol leu glu tyr leu vol ser phe gly vol trp ile org thr pro pro olo tyr 
721 / 241 751 / 251 

AGA CCA CCA AAT GCC CCT ATC TTA TCA ACA CTT CCG GAA ACT ACT GTT GTT AGA CGA CGG 
org pro pro osn olo pro ile leu ser thr leu pro glu thr thr vol vol org org org 
781 / 261 811 / 271 

GAC CGA GGC AGG TCC CCT AGA AGA AGA ACT CCC TCG CCT CGC AGA CGC AGA tec coo teg 
osp org gly org ser pro org org org thr pro ser pro org org org org ser gin ser 
841 / 281 871 / 291 

ccg cgt cgc ago ego tct CAA TCT CGG GAA TCT CAA TGT TAG 
pro org org org org ser gin ser org glu ser gin cys AMB 
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